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(57) Abstract: An MMS communication 
system for displaying images on a display 
terminal of a mobile or portable communica- 
tion device, the system comprising: an input 
adapted to receive pre-source information^ 
transmitter adapted to transmit the pre-source 
information^ server adapted to receive the 
transmitted pre-source information and further 
adapted to convert the pre-source information 
to source information suitable for display on 
the display terminal; and a source transmitter 
adapted to transmit the source information to 
the display terminal. 
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MMS SYSTEM AND METHOD WITH PROTOCOL CONVERSION 
SUITABLE FOR MOBILE/PORTABLE HANDSET DISPLAY 

t==DESCRIPTION 

ikfti — Itetetedf Applications 
[l]This Application claims priority from co-pending U,S, Provisional 
Application Serial No, 6G/299*74S filed June 22* 2GG1* which is 
incorporated in its entirety by reference, 

[2]This disclosure teaches techniques related to an MMS (Multimedia 
Messaging System)* including images* graphics* numerics* and text* 
suitable for display on the display of a mobile or portable communication 
handset terminal, 

[3]To understand the disclosure better* the following definitions for technical 

terms used in this disclosure is provided: 
[4]1, EMS; Extended Messaging System* a source protocol used for EMS 

handsets* designed to encode multimedia messages* including images* 

graphics* numerics* animations* audio and formatted text, 
[S]2, MMS; A pre-source (multimedia ^formatting information) protocol used 

to encode types of messages* including images* graphics* numerics* and 

text* and transcoded for the display/phone speaker on various display 

terminals, Used in most Nokia handsets, 
CS]3, PM; Picture Messaging protocol* a graphic format (source protocol) 

used to display B/W images in Nokia handsets supporting the NSM per- 

source format, 

[?]4, Pre-source information; In this application* information* which may be 
a full multimedia message or some part thereof* which appears in a non- 
source format and is not coded in a source protocol, Pre-source 
information refers to "packaged* multimedia content in a *raw* format 
such as: 

t8]a. A set of TCP/IP packets composing a MIME multipart message (could be 
an email message or an MMS MM1 message)* where some parts of the 
message are media objects which need to be converted/transcoded* and 
some other parts (e,g, SMIL attachment) are presentation layer 
information relating to how the information has to be arranged and 
displayed, Rga, 24 and 2S illustrate this concept, 

[9]b, A block of SMS messages that together compose an EMS or a Nokia 
Smart Messaging (NSM) message and contain multiple media objects - 
pictures* ringtones* etc. These SMS messages are further encapsulated 
into the SMSC protocol which can be SMPP*UCP*CIMD etc, 

[10] 5, Smart Messaging; A source protocol being developed by Nokia for 
Nodia handsets, This refers to everything defined in the NSM pre-source 
protocol and adds functionality for calendar events (vCalendar), electronic 
business cards (vCard) etc. 
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til] 6. Source information; In this application, information, which may be a 
full multimedia message or some part thereof, which appears in a source 
format and which is coded in a source protocol. Typical source protocols 
are WBMP, EMS, and PM, but new protocols are being developed on an 
ongoing basis. Source protocols enable the display of messages on 
terminals with limit memory, processing, and display capabilities, such as 
those of mobile and portable radio communication handsets (i.e., cellular 
telephones, land mobile radios, Instant Messaging terminals, radio 
enabled PDAs, and the like). On the other hand, source information 
constitutes media objects in some media format, e.g. a JPEG picture, an 
MP3 audio file, an AVI video etc. 

[12] 7. Transcoding; To perform protocol conversion, either from one 
source protocol to another, or from a pre-souree protocol into a source 
protocol. 

[13] 8. WAP; "Wireless Application Protocol*', one protocol used for what 
have been called 2,SG cellular systems. In the cellular world, 1G was the 
original set of analog cellular systems. 1G has been mainly displaced by 
2G systems, which are low-speed digital systems, which a typical raw data 
rate of 9.5kbps. Operators are currently deploying what are known as 
2.5G systems, which are higher speed digital systems, expected to 
operate up 384kbps. 2.5G systems are expected to be replaced by 3G 
systems, which are higher speed digital systems promising speeds up to 
2Mbps. WAP is one of the chief manifestations of 2,5G systems. WAP is a 
pre-souree protocol. 

[14] 9. WBMP: The display protocol for handsets in the WAP system. 

2. I ntroduction 

[15] The process of transcoding is not a new idea. Indeed, in forms the 
basis of communication systems. Even the conversion from analog to 
digital, or vice verse, is a form of transcoding. With the proliferation of 
higher speed digital cellular systems, the challenge and the problem of 
transcoding have become much greater. There is, as yet, no standard 
display protocol for higher speed communication terminals. Therefore, 
transcoding from one display protocol to another is required to insure that 
the receiving terminal will be able to display the transmitted. There is, 
however, no method or system to do this in such a way that the integrity 
and quality of tine transmitted message will be maintained in the display 
terminal. Further, there is no method or system for transcoding non- 
source information into a source protocol suitable for display on a 
communication terminal, while maintaining the integrity and quality of tine 
information which originally appeared in the non-source format. There 
are transcoding systems and methods, to be sure, but they are primitive, 
and lose much of the quality of the source or pro-source information, even 
to the point where in some cases the displayed information in the display 
terminal is not recognizable. What Is required is algorithms, a method, 
and a system, that will allow identification of the specific display 
characteristics of the target display terminal, and will also allow the source 
or pre-source information to be displayed on the target display terminal 
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with the maximum amount of integrity and quality in comparison to the 
pre-coded information, 
tfe=SUMMARY 

[16] The disclosed teachings provide for: 

[17] 1, Conversion of source information coded in a source format into a 
protocol suitable for transmission to and display on the terminal, A 
variety of new processing techniques are disclosed. Source information is 
typically coded in protocols such as WBMP (the protocol for Wireless 
Application Protocol, or "WAP*, systems), EMS, and PM. This information 
must be transcoded for display on different terminals, also using source 
protocols, but where the protocols and variations of the protocols are 
typically different between the input source and the display terminal, 

[18] 2. Conversion of pre-source information, that is, information which is 
coded but not in a source protocol, into a source protocol. For example, 
an ordinary digital picture will be transcoded Into the source protocol 
WBNP, It will be appreciated that information in source protocol will then 
be transcoded again into the target source protocol, as explained in 
introductory point 1 immediately above, 

[19] An MMS communication system for displaying images on a display 
terminal of a mobile or portable communication device, the system 
comprising: an input adapted to receive pre-source informationja 
transmitter adapted to transmit toe pre-source information^ server 
adapted to receive the transmitted pre-source information and further 
adapted to convert the pre-source information to source information 
suitable for display on the display terminal; and a source transmitter 
adapted to transmit the source information to toe display terminal. 



ttt5=BRlEF DESCRIPTION OF THE DRAWINGS 

[20] The above advantages of the disclosed teachings will become more 
apparent by describing in detail preferred embodiments thereof with 
reference to the attached drawings in which: 

[21] FIGS. 1-37 show various features of the disclosed teachings as 
described in the rest of this document, 

t¥i= D ETAILED DESCRIPTION 
¥&e&s=Over<aH Architecture. 

[22] An implementation of the disclosed teachings is shown in FIG.5, The 
structure includes the input devices 5,11-5,13 on the left, the server 5,2 
represented by the block in the middle, and the display devices 5.31-5,34 
shown on the right. Information may be in source format, as is the case 
for the cellular telephone 5.12 (picture of two people) and the digital 
camera attached to a cellular telephone 5.13 (picture of the automobile). 
Or information may be in a pre-source format, such as the cartoon of the 
man 5.11. At the server, the source information or pre-source 
information is processed by a variety of components, adapted to 



3 



WO 03/001770 



PCT/IB02/04148 



implement a variety of algorithms or techniques, which form an integral 
part of the disclosed teachings, 
[23] Some of the components mentioned above perform at least the 
following tasks: 

[24] 1. Format transcoding (from the p re-source information into source 

information, or from source information into other source information 

suitable for display on the display terminal) ; 
[25] 2. Image adaptation, to adapt the image to the particular display 

screen on the display terminal, (This is discussed in further detail in 

section IVG); 

[26] 3, Optimal compression for handset. The display terminal cannot 
display all of the bits of the original information. The information must be 
compressed, both for transmission and for display, 

[27] 4. Photo enhancement: Specific sections of a photograph may be 
cutout and enhanced, (This is discussed in further detail in section IVG); 

[28] S. Content based processing, by which different aspects of a 
multimedia message are identified and processed differently, (This is 
discussed in further detail in section IVG); 

[29] 6. Recognition from images: This is allied to the photo enhancement 
algorithm. Different portions of an image are recognized and "cutout* for 
enhancement, (This is discussed in further detail in section IVG); 

[30] 7. Interfaces to 3rd Party Applications: Third party applications may be 
processed separately and sent to the display terminals, or may be added 
to the original information. In addition, if there are software packages 
with additional algorithms for additional processing of the source 
information, these may abe accessed and applied to the original 
information for eventual display on the display terminals. An example of 
an interface to a third party application is an XML-based interface over 
TCP/IP, Another example of such an interface would be and API in C++ 
or Java. By third party application we are referring to both external SW 
modules, and to VAS - value added services - e,g, a news service, a 
gaming platform for cellular phones (for example 
www,wirelessgames,com, www,cash-u,eom), a photo album service. 
These applications wish to send MMS content and also in certain cases to 
perform special processing on such content prior to sending it. For a 
review of the special functionality please refer to section IV, H, 

[31] Another implementation of the disclosed teachings is shown in FIG,6, 
Examples of input sources, called "Content Sources*, 6,11-6,14, appear in 
the column at the right. However, the input sources are much broader 
than these pictures. At the bottom of the figure are various information 
devices 6,21-6,24 which can serve as both sources of information to the 
server 6,3, and also receivers of information processed by the server. 
These information sources can be WAP or i-Mode Phones, called *MMS 
Box* in the slide, (hMode phones are those that operate on i-Mode 2,5G 
cellular systems currently functioning in Japan, i-Mode phones will also 
operate on 3G systems expected to be introduced in Japan in late 2001 
and in 2002,) Picture messaging phones, operating with the PM protocol, 



4 



WO 03/001770 



PCT/IB02/04148 



are portrayed in the "Picture Box**. Similarly, EMS phones, operating with 
the EMS protocol, are portrayed in the *EMS Box* 1 , Finally, Email enabled 
phones are portrayed in the "E-mail Box*, In this implementation, 
information may be coded out of or coded into, any of the protocols show, 
Including WAP phones (WBMP protocol), I-Mode Phones (the Japanese 
version of WAP), Picture Messeging Phones (PM protocol), EMS Compliant 
Phones (EMS protocol), and E-mail Capable Phones (POP3, SMTP, and 
IMAP4 protocols) 

[32] The server will receive, transcode, and optimize for display on a 
specific communication terminal, at least any of the following protocols: 
IP, SMPP, TCP/IP, POP3/SMTP/IMAP4/XML, This is illustrated in FIG,7 

[33] Using the transcoding several tasks are accomplished, for example; 

[34] 1. The conversion from and into any of the various formats WBMP, PM, 
and EMS. 

(35] 2, The conversion of formats on the fly when the user logs on to his or 

her MMS box (so that even if the user were to switch terminal devices, he 

or she would get the correct content, properly formatted), 
[36] 3, The conversion taking into account the exact parameters (such as, 

for example, screen size, pixel dimensions, etc.) of the particular terminal 

and terminal display, 
[37] 4, Transcoding of source information into other source information 

suitable for display on a target terminal, maintaining integrity and quality 

of the original message, examples would be; 
[3B] a, A JPEG image is converted to a GIF image so that a phone with a 

WAP browser than can display GIF images will be able to view it, 
[39] b, A Nokia rlngtone is converted to an EMS IMelody ringtone so that a 

non-Nokia phone can play it, 
[40] c, A video in MPEG1 format is converted to MPEG-4 Simple Visual 

profile so that an MMS compliant phone can display it, or to an animated 

GIF sequence so that a non-MMS compliant (legacy) phone can view it, 
[41] d. Formatted text in the EMS format can be converted to an image or 

to HTML-*- text to preserve the formatting (underline, bold, letter size etc,), 
[42] It should be noted that both the EMS and NSM formats include not 

only images but also ringtones, animations and formatted text. This fact is 

well documented in the EMS/NMS standards documents for the last few 

years, 

[43] 5, Transcoding of non-source information into other source 

information, for later transcoding into source information for display on a 
specific target terminal, where all transcoding maintains the integrity and 
quality of the original message, 

[44] 6, Recognizing the specific display characteristics of a specific target 
display tormina!, to enable the display of high quality messages, whatever 
the characteristics of the terminal, 

IV.B. A n Example Implementation: Overview 
[45] FlG.8 shows another example implementation of the overall system. This implementation is called 
MMR. The MMR system allows users to send and receive messages containing text and images at least 
in the following formats / protocols: WAP; PM; EMS; MMS; E-MAIL; WEB; SMS. 
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[46] While the term message and information has been used in discussing the implementations in detail, it 
should be clear to a skilled artisan as to which message/information constitutes pre-source information 
and which constitutes source information according to the definitions for the same provided in the 
background. 

±. — User WAP Pages 

[47] The MMR provides a WAP based messaging application, allowing users to login to their personal 
messaging page. From this page users can view and send messages in variety of formats. The MMR 
sends the WAP recipient an SMS notification with a link to the newly received message. Alternatively, 
the MMR can send a WAP push message with the same link. MMS recipients receive a notification to 
the new MMS. PM and EMS recipients receive the message directly. Email recipients receive an e-mail 
with an image attachment to their regular e-mail address. 

[48] It should be noted that WAP pages can contain multimedia for immediate integrated display (e.g. a 
WBMP image), but can also contain downloadable multimedia such as higher resolution images, 
ringtones, animations etc. , as part of e.g. the M-Services standard for downloadable media. 
S=— User Web Pages 

[49] The MMR provides a WEB based portal for three major function. 

[50] L Users can register themselves to the system, by submitting personal information as well as 

information about the model of their mobile device. 
[51] 2. A photo album application is provided for personal storage and sharing of images and audio files. 

Users can login to their personal accounts, view and send messages to mobile device, in the same 

manner as described above. 
[52] 3. Users can also login to an HTML based messaging page which allows them to view all their received 

messages. Regardless of the format in which these messages were originally sent, these messages will 

be transcoded to be viewed on a normal web browser. This may allow users to view messages in a 

much higher quality than that seen on their mobile devices. 
a~MMS Module 

[53] The MMR can send and receive MMS messages to and from mobile devices. Incoming MMS messages 
are parsed and transcoded for optimal display on the recipient's device. Recipients of outgoing MMS 
messages receive a notification, allowing them to download the message from the MMS proxy. Other 
recipients receive the MMS message after it has been transcoded to WAP, PM , EMS , SMS or E-Mail. 
4^=*Email Module 

[54] The MMR allows users to send E-mail messages with image attachments to mobile devices. It also 
allows mobile users to send E-Mails with image attachments to regular e-mail addresses. Incoming E- 
maiis are parsed. The e-mail subject is sent as the message text, while each of the image attachments in 
the original e-mail are transcoded for the mobile device. Depending on the amount of attachments, the 
recipient may receive several messages, and a format most suitable to his mobile device. Outgoing e- 
mail messages use SMTP to send the message text along with an image attachment to the selected 
recipient (any e-mail address). 

[55] It should be noted that the e-mail interface is also utilized for sending images from an Ericsson 

Communicam to other mobile devices. Communicam images are posted from the camera to a dedicated 
server, which converts these images to an e-mail with image attachments. Proper configuration of the e- 
mail recipient address allows the user to send these images to other mobile devices. Communicam is a 
specific commercial line of cameras that can be attached to phones. It is referred here to a general 
camera attached to a phone. 

SMS Module 

[56] The MMR allows users to send messages in several SMS based formats. Picture Message for Nokia 
phones, and EMS messages for Ericsson phones are supported. 

[57] Incoming messages are transcoded into PM and EMS, dividing the original message into up to 6 SMS 
messages. The recipient's phone receives these SMS messages, and concatenates them. When all SMS 
messages have arrived, an application on the mobile device displays the message content. 

[58] It should be noted that MMR can also receive PM and EMS messages originating from mobile devices, 
and transfer these messages in different formats to other devices. This feature requires a special 
agreement between the SMS service provider and the MMR operator, in order to forward all 
concatenated SMS messages through the MMR. 
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[59] The conversion a full message (pre source with source objects) is a conversion where certain 
constraints and relations between the media objects requires more processing and application of 
"business rules": for example, if an EMS message which is 6 SMS long is sent to a Nokia phone (NSM 
messages are up to 3 SMS long) some or all of the following operations may take place: 

[60] a. The images get resized to reduce their size in bytes. 

[6 1] b. The Audio files get truncated to reduce their size in bytes. 

[62] c. The text formatting may be removed to reduce total message size. 
£— -MMR Logic 

[63] The MMR-Logic controls the behavior of the MMR. Using the MMR database, and set of configurable 
rules, the MMR Logic selects the most suitable message format for each recipient. It then determines 
the correct data flow path for each of the possible message transactions. The MMR-Logic is also 
responsible for the on-the-fly gathering of information about users and their mobile devices. This is 
performed by e.g. registering the WEB /WAP user-agent of the phone when it sends requests, or by 
identifying the type of the message sent from a phone (e.g. an EMS message) which indicates that this 
phone can send/receive message in this format. 
^=-MMR Database 

[64] The MMR database stores information about the system users, such as name, phone number, phone 
model etc. Message contents, i.e. images, audio and text, are also stored in the database. The MMR 
database also contains information required by the O&M block. 
O&M 

[65] The MMR O&M functions provide the MMR system adrninistrator with an array of tools to monitor 
and control the behavior of the MMR. A Web based interface, provides the administrator access to web 
pages, arranged in groups according to the functional blocks in the MMR. 

[66] The O&M also provides me administrator with a messaging page, which allows him to send messages 
in all formats to mobile devices. 

ft: — MPS Client 

[67] The MPS client translates the required transcoding action, as determined by the MMR-Logic block, 
into an XML request. This request is then posted to the MPS server rack. The MPS client then parses 
the response, and extracts the transcoded image. Further details are found in section IV J 
44v~MPS (Media Processing Server) 
[68] The media-processing server handles the message transcoding from one format to the other. Other 
image processing functions such as face detection can also be called via the XML interface. 
44^VASP RPC Module 

[69] The MMR provides an external interface to Value Added Service Providers (VASP), allowing remote 

invocation of MMR functions via an XML RPC interface. 
[70] Tthe XML RPC infrastructure can be easily expanded to include additional remote procedure calls. 

Details are found in sections IV H and the last section titled MPS control interface document. 
^-Internal WAP GW 

[71] The MMR hosts an internal WAP gateway. This gateway is required to support functionality not yet 
supported on commercially deployed gateways. The internal WAP gateway allows the MMR to 
send/receive MMS messages, as well as use WAP push for message notifications. The inclusion of an 
internal WAP gateway is optional, not a must. An external MMS compliant WAP gateway supporting 
segmentation and re-assembly (S AR) and MMS tags/mime types can be used. 
^-Internal SMS GW 

[72] The internal SMS GW is used due to special functionality required for receiving EMS and PM 
messages. The SMS gateway is an interface layer/mediator for receiving and sending the SMS 
messages from/to an SMS center (SMSC) via the prevailing protocols such as UCP,SMMP,CIMD etc. 
Z K C Details of Selected Functional Blocks 

■k— User Web Pages - The MMR Web Site 
[73] The public MMR main web portal contains links to at least the followingr functions: 
[74] Link to the user's personal web based "Messaging Application" 
[75] Link to the "Photo Album" 
[76] Link to the "User Registration Page" 

Web-based Messagine Application 
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[77] The web based messaging application provides similar functional capabilities to the wap based 
application. User's may enter their personal messaging page, by using the same user name and 
password as used on their mobile phones. Once inside, user's can view and send messages in a variety 
of formats. 

[78] From the web-based application, users can also send messages with original content. 

b^-Web-Based Photo Album 
[79] The MMR provides a web based photo album application, allowing the user to upload and manage 
their own folders containing images and audio files. These files can then be shared with friends, or sent 
to mobile devices in a variety of formats. 
[80] The MMR can automatically select the message format most suitable for the recipient, or may receive a 
request from the sender to send the message in a specific format. 

e^Web-BasedUser Registration Page 
[81] The user registration page allows new users to register themselves to the service. It also allows 

registered user's to update their registration information. Registration information requires the user to 
submit some personal details, as is accustomed in web based email services. In addition to this 
information, the user can be asked to submit information regarding the model of his mobile device. 
g~~The E-Mail Module 

[82] The MMR allows users to send E-mail messages with image attachments to mobile devices. It also 
allows mobile users to send E-Mails with image attachments to regular e-mail addresses. Incoming E- 
mails are parsed. The e-mail subject is sent as the message text, while each of the image attachments in 
the original e-mail are transcoded for the mobile device. Depending on the amount of attachments, the 
recipient may receive several messages, and a format most suitable to his mobile device. Outgoing e- 
mail messages use SMTP to send the message text along with an image attachment to the selected 
recipient (any e-mail address). 

gj— Email Server Account Set-Up 
[83] The e-mail server needs to be configured to receive all mails addressed to a selected domain , e.g. 
pics.ucngo.com . All incoming e-mail messages in this format are accepted by the e-mail server. 
Furthermore, the server is configured to create an event for each incoming message. This event triggers 
the MMR to handle the new message an described in the sections below. 
fe^— Email to Mobile Device 
[84] 1. Receive messages sent to : user-phone-number@pics.ucngo.com e.g. 

97254985026@pics.ucngo.com 
[85] 2. The e-mail is handled as follows: 

[86] i) The email subject is extracted, and used as the message text. (The message text is limited to 120 

characters (configurable)). 
[87] ii)The phone is extracted from the email address and used as the recipient's number. 
[88] iii) The Image attachments are used as the message images. Each attached image generates a new 
mobile message, with the same message text. The maximum allowed message size is 150Kbyte 
(configurable) 

[89] 3. The message is stored in the message table of the MMR database. 

[90] 4, The MMR-Logic selects the optimal message format for the recipient. 

[91] 5. A message in the selected format is sent to the recipient. 

Outgoing Error Messages 

[92] Incoming e-mail messages that can not be handled according to the logic above, will generate an error 
message. This error message will be sent to the e-mail originator, to notify him that his message could 
not be handled. For several expected cases, the exact error will be given, to explain why the message 
could not be handled: 

[93] i) Recipient number is not a valid number, or is unknown to the system. 

[94] ii) Message does not contain a valid image attachment. 

[95] iii) Message attachments are larger than the allowed quota. 

<£— Mobile Device to E-mail 

[96] Mobile devices may send e-mail messages via the WAP messaging portal. 

[97] During the process of sending a message, the WAP user is provided with a "send as" link, allowing him 

to select from a list of optional formats. 
[98] By selecting "send as e-mail" the user prompts the following chain of events: 
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[99] i) A new e-mail message is composed. 

[100] ii) The image original is taken from the message database, and sent as an e-mail attachment. 
[101] iii) The message text, as edited by the user is sent as the e-mail body. 

[102] iv) The e-mail subject is a generic message in the form of: "You have received a new mobile e- 

mail from <sender number>" 
[103] v) The recipients e-mail address can be entered in one of two ways: 

[104] 1 . If the e-mail recipient is a registered user, the sender may type in the recipient's phone number, 

and the MMR will lookup the recipient's e-mail address from the database. 
[105] 2. If the recipient is not registered, or if his e-mail address is not known, the sender will be directed 

to a wap page from which he can edit the required e-mail address. 
j?°»SMS based Messaging 
[106] The SMS-based module is in charge of generating at least the following message format: EMS, 

PM, WAP link notification (SMS or WAP-push), WAP-push MMS notification, Text SMS for 

devices that do not support images. 
[107] Furthermore, the SMS module includes an SMS link layer, capable of receiving EMS and PM 

messages from mobile devices. The link layer can then concatenate the fragmented SMS messages that 

make up an EMS or PM and extract the message image and text. These messages can then be 

transcoded into any of the supported formats. 
[108] The MAS core is a group of java servlets, that handle image transcoding management / message 

transfer / database functions / billing and O&M functions. 
[109] These servlets have external interfaces to an Email server , SMS center and WAP / WEB gateway 

allowing the MAS to interconnect between devices using these protocols. Refer to the : SMS / EMS / 

PM Module for a more detailed block diagram of the SMS / EMS / PM modules. Refer to the : MAS 

E-Mail Module for a more detailed block diagram of the POP3 / SMTP modules. 
[110] The PM/EMS/SMS receive will be handled by a dedicated servlet, it will interface all mcoming 

SMS's handled by the SMS GW. It will encode the incoming SMS's using the following top level logic: 
[111] 1) Detect type of message single, concatenated 
[112] 2) For Single Message: 
[1 13] a) Detect Type of message Text, PM, EMS 
[1 14] b) Extract Image or Text from message 

[115] c) Post data to with text & phone number to sms handler servlet 
[116] 

[117] 3) For Concatenated Messsage Store in local db with message ID 

[118] 4) When Last message received - (from analyzing XX,YY,NN : XX — msg id, YY - total number 

of msgs, NN - msg sequence in the UDH) and tracckin sequence of received messages, do: 
[119] a) Concatenate message data 
[120] b) Detect Type of message Text, PM, EMS 
[121] c) Extract Image or Text from message 

[122] d) Post data to with text & phone number to sms handler servlet 
4; — MMR-MM1 System Logic 

[123] The MMR Logic module determines the data flow path and transcoding type used on messages 
that go through the system. Sub-section 4(a)defines the chain of events that take place, for each of the 
possible combinations of input and output formats. However, there are cases where the recipients 
phone capabilities are either not fully known, or the recipient's phone may be able to accept messages 
in more than one format. 

[124] Subsection 4(b) deals with selecting the correct message type for the recipient.This sub-section 
deals with scenarios where either the sender or recipient's information is either not known to the 
system, or it conflicts with previous information stored in the MMR about the user. 
«^~Transcoding Matrix 

[125] The MMR enables messages to be sent from one device to the other, automatically transcoding the 

message content from the source device format to the target device format. 
[126] The supported formats are : WAP / WEB / E-Mail / PM / EMS / MMS / SMS. 
[127] The following sub-sections describe some of the various transcoding actions taken for each 

combination of source and destination formats. 
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[128] Image source data is already in the database. 

[129] Send notification to the recipient with a link to the new image. 

[130] Transcode the image when the recipient enters the message page according to the UA used. 

[131] Upload new image via HTTP to the database. 

[132] Send notification to the recipient with a link to the new image. 

[133] Transcode the image when the recipient enters the message page according to the UA used. 

^= WEB to Preview 

[134] Upload new image via HTTP to the database. [135] Transcode the image to the expected 

recipient phone type. 
[136] Transcode the image from the format above to the PC monitor format. 
[137] Display resulting image wrapped in some html. 
[138] Delete the image original from the database. 

^RM&iltoWAP 

[139] Extract images and recipient numbers. 

[140] For each recipient send all the images. 

[141] Send notification to the recipient with a link to the new image. 

[142] Transcode the image when the recipient enters the message page according to the UA used. 

[143] Take image original from the database. 

[144] Transcode to GIF or JPEG to a size no larger than 25Kbyte (Configurable) 

[145] Send to recipient's e-mail address. 

[146] Take image original from the database. 

[147] Transcode to PM / EMS / MMS according to recipient phone type. 
[148] Send message. 

^= WEE to PM ifM& I MMS 

[149] Upload image from web or photo-sharing site. 

[150] Transcode to PM / EMS / MMS according to recipient phone type. 

[151] Send message. 

^=E^Mi m pm // ems n mms 

[152] Extract images and recipient numbers. 
[153] For each recipient send all the images. 

[154] Transcode the image to PM / EMS / MMS according to recipient's phone type. 
[155] Send message. 

f^WAP / WEB / E-Mail to SMS 

[156] This mode will be used when the recipient's phone cannot display images. 
[157] The message text will be sent as an SMS to the recipient. 

fM^=lMS^PMt*>WAP* 
[158] Receive and store EMS/PM as fragmented SMS messages. 
[159] Link fragments to a complete EMS/PM 

[160] Send notification to the recipient with a link to the new image. 

[161] Transcode the image when the recipient enters the message page according to the UA used. 

ftt^=BMS f PM to EMS II PM * 
[162] Receive and store EMS/PM as fragmented SMS messages. 
[163] Link fragments to a complete EMS/PM 

[164] Create and send an EMS or PM according to the recipients known device capabilities. 

«=EMS/PMtoMMS* 
[165] Receive and store EMS/PM as fragmented SMS messages. 
[166] Link fragments to a complete EMS/PM 
[167] Transcode EMS/PM into MMS format. 
[168] Initiate an MMS transaction with the WAP Gateway. 

(13) A ll to SMS 
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[169] Strip images from the original content. 

[170] Send the first 160 characters of the text message as an SMS message. 

(14) MMS to all fottmte 

[171] Receive the MMS message. Extract the content files and the SMIL. 
[172] Insert the all image and audio files into the database. 

[173] Enter the SMIL file into the database. Transcode the SMIL informatin into HTML/WML/EMS 
formatting information if the targets are WEB,WAP,EMS respectively. If the target is email or an 
MMS phone that does not support SMIL, the media objects (MIME objects) may be reordered based 
on the information in the SMIL description to ensure proper viewing order between the various media 
objects. 

[174] Send the message to the selected output format, as if it came from the WEB. * - Requires 

redirection of concatenated SMS messages by the SMSC through the MMR. The MMR then handles 
EMS/PM messages according to the above logic. 

k^=~Sender and Recipient Logic 

[175] In order to properly perform a message transaction, full knowledge is required about the sender 
and the recipient. This is required to allow for optimal transcoding of the input message to the correct 
output format. However, there are cases where either the sender or the recipient (or both), are "not 
completely" known to the MMR. The data stored in the MMR database may be incomplete or 
inaccurate. For example, a user might be registered to the service, without the MMR knowing which 
device is being used. This information may also change when the user moves his SIM card from one 
device to another. In other cases, the user might not be registered in the database at all. The purpose of 
this module is to perform the correct logic decisions, to make the most out of the data that is known to 
the system. Furthermore, the sender and recipient logic are used to gather information about the system 
users in an un-formal way, by correlating information such as phone numbers, device user agent, and 
incoming message formats. This information is added to the information submitted by the user, during 
the registration to the service (which is not mandatory, but recommended). Further details are included 
in Section J. 

e^— Qn-The-Flv Data Collection 

/ EMS / MMS Capability (PM .EMS input 
Disabled) 

[176] When an unregistered or a partially known user sends a PM , EMS , or MMS message to an MMR 
recipient, the MMR can register the sender on the fly. The purpose of this action is to update the 
database, and add users on the fly. If the user was already registered, the MMR checks that the user's 
capability to send messages in this format is already known. 

^Ujp^itiag the Ufeafe WAP Jfavfite 

[177] When a registered user sends a WAP message to an unregistered recipient, this recipient receives 
an SMS notification. When the recipient enters the message page, his phone's User- Agent becomes 
known to the system. At this point the MMR can add the recipient as a new user, and assign the correct 
device to him. This function is also useful for registered users who are now using a new phone model. 
The database can be updated with the new user agent. Accordingly, other capability flags, such a PM 
and EMS might now change. The exact same description also applies for phones using HTTP (standard 
WEB) browsers - there too the browser specifies a User Agent. 

f^SymhsQtiidfig the User data aad devtee data 

[178] At any given moment, the MMR database might hold information about the user and his mobile 
device. Since some of the message formats may operate by using the user's capability flags alone, some 
users may not have a registered device type for extended periods. When user's register themselves 
through a dedicated registration process, or when users enter a WAP session their device becomes 
known. At this point it is important to verify that there is no discrepancy between the user's capability 
flags, and the devices' capability flags. The synchronizing process forces the devices' capabilities on the 
user. 

4^==Selected Message Type Logic 
[179] This section explains the logic implemented in the MMR Logic Module, to select the correct 
message type for the recipient. The logic is divided into a case where the recipient is registered (at 
least with partial data), or when the recipient is unknown to the MMR. 
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[180] For each transaction, this logic should bring the system to the following state: 

[181] There is a valid sender data structure. 

[182] There is a valid sender device data structure. 

[1 83] There is a valid recipient data structure. 

[1 84] There is a valid recipient device data structure. 

[185] The format of the incoming message is well defined. 

[186] The format of the required output message is well defined. 

[1 87] The flow of events required to carry out this transaction is well defined as discussed above in 4(a). 

ffc^Sd^tfcd Message Typfe for a. r^st^d recipteftt 

[188] The selected message type for the recipient for any transaction is effected by the following 
parameters: 

[189] A direct request by the sender for a specific message type. 
[190] The user's device capability flags. 
[191] The user's capability flags, (if the device is not known) 
[192] The user's preferred message type. 

[193] The device's preferred message type (if the user did not specify one) 

[194] The original message format. 

[195] 

[196] The capability flags show if the user can accept message in the following formats: 

[197] MMS / LEMS / SEMS / PM / WAP Flag. 

[198] 

[199] Due to the information required before a decision can be made, the recipient's data must be known. 
The receiver data may either be known because it was stored in the database, or because it was 
temporarily created for this transaction, as explained in section 4(b). In any case, at this point there can 
no longer be a discrepancy between the user's capability flags and his device's capability flags. 

[200] Given that the recipient is known, the selected message type will be chosen according to the 
following logic. 

[201] If the sender requested a specific format, that format is selected. (Forcing the format by the sender 
may result in the message not being sent. This is not the normal mode of operation. In the normal 
mode, the sender selects "automatic" and the MMR decides the best format automatically.) 

[202] If the sender mode is automatic, the user's "preferred message type" is compared to the devices / 
user's capability flag. If it is a legal selection, the message is sent to the user in his preferred format. 

[203] If the user has no specific preference, and user's device data is the next dominant information 
according to the following logic: 

[204] o If one of the optional formats of the device allows the message to be sent without being 
transcoded, that format is selected. 

[205] o If the message must be transcoded, and the device has a "preferred format", that format is 
chosen. 

[206] o If the device data doesn't specify a "preferred format", the best of the format options is selected 

according to the following order: MMS, WAP, EMS, PM. 
[207] o If the user's device is not known, the user's data is the next dominant information according to the 

following logic: 

[208] o If one of the optional formats acceptable by the user allows the message to be sent without being 

transcoded, that format is selected. 
[209] o If the message must be transcoded, the best of the format options is selected according to the 

following order: MMS, WAP, EMS, PM. 

^=Sd^fed Message Typs for <m xm^st®t®l retipteat 

[210] If the sender requested a specific format, that format is selected. (Forcing the format by the sender 

may result in the message not being sent.) 
[211] If the sender did not request a specific format, the message will be sent according to a default 

table. The default table will be able to select a default output format for each input format. This table 

will be configurable with a simple editor. Table 1 is an example of the default output format table. 
[212] A temporary recipient data structure is created with coherent information to allow the selected 

transaction to take place. 
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[213] 

[214] Input Format WAP WEB E-Mail PM EMS MMS 

[215] Default Output Format WAP WAP WAP PM EMS MMS 
[216] Table 1 : Default output format table. 

^— Transmitting pre-source information to server 

[217] The pre-source information is transmitted to the server. The following 
description along with the figures 9-11 referred to herein provides, to a 
skilled artisan, further explanation on the transmission of pre-source 
information. 

a^=-User Control Web Page 

[218] This is a site that will allow users to change toe attributes the system 

holds about their phone. Authentication will be done as follows: 
[219] User enters a page. 
[220] Enters his phone number. 

[221] System sends an SMS with a 4 digit code to the user while he Is 
surfing. 

[222] User will be able to change his phone type, EMS capabilities etc. 
[223] User pressed submit. 

[224] System prompts the user to get toe secret code from toe SMS in box. 
[225] User will enter the code and his registration details will be changed. 

feWJSP Pages 

[226] The following sections are pseudo-code descriptions of toe JSP pages. 

[227] Enter your Phone number 
[228] Enter your Password 

[229] Struct userstruct = GetUserStmetByPhoneNumbeKnumber) 
[230] Bool isvalid « AuthenticateUser(number, userstruct.password) 
[231] If (isvalid) Goto Main.jsp (2.3) else if (Try again ?) goto login.jsp 
[232] If ("Password* \ \ "New user") Goto Forgotomew.jsp 

[233] Enter phone number 

[234] Bool newuserok = adduser(number, user-agent) 

[235] If (newuserok) Bool passwrdsetok « SetNewPassword(number) 

[236] If (passwrdsetok) userstruct « 

GetUserStructByPhoneNumber(number) 
[237] Bool smssent * SendSMS(userstruct.password , number) 

[238] Bool gotmsgvect « GetUserMessageIdVector(const number, 
msgidvect) 

[239] If (gotmsgvect) numofmessages <= msgidvect.sfee 

[240] Set number in the brackets (i.e. Messages ) 

[241] Bool gotarehivevect = GetArchiveNameVect(archivevect) 

[242] Int numofarchives - archivevect.stee 

[243] Set number in the brackets (i.e Archives ) 

[244] If "Messages"* goto messages^jsp 

[245] If "Archives* goto archives.jsp 

[246] for (I~Q 1 1 « numofmessages) CreatoUnkToMsg(msgidvect[I]) 
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[247] if "Message* pressed, goto msg.jsp (2.5) 

[248J if "Delete AH" pressed Bool deleteOK - DeleteAIIMessages(msgidvect) 
[249] if (deleteOK) goto MessagesDeleted.jsp, and then goto main.jsp 

[250] msgStruct = GetMessageStruet(msgidvect[I]) (including 

GetlmageFilelD) 
[251] String msgtxt ■ msgStruet.txt 

[252] Vector < chars* image^buffer = msgStruct.GetlmageBuffert ) 
[253] DevStruct SourceDeviceStruct « 

GetUserDeviceByNumber(msgStruct.Prom) 
[254] Storing UA =GetUserAgentStringFromSession( ) 
[255] DeviceStruet TargetDeviceStruct = GetUserDevieeByUserAgent(UA) 
[256] XMLreq « GetXMLrequest * (SourceDeviee, TargetDevice , 

iipn&Q^ buffer*) 
[257] XMLresponse « PostXMLrequest( XMLreq ) 
[258] Outjmagejauffer ■ GetlmageFromXML (XMLresp) 
[259] Output message.wml 
[260] If "Send" goto send jsp 
[261] If "Delete Message* Bool msgdeleteOK * 

DeleteMessage(msgStruct,msgID) 
[262] If (msgdeleteOK) goto MessageDeleted„jsp, and then goto 

messageSsjjsp 

[263] Edit text edit box, 
[264] Edit number edit box. 

[265] If "Send as* pressed goto #sendas card* Select message type, 
[266] MsgStruct ■ CreateMessageStruct(From t To , type * text „ 
imageFilelD) 

[267] If "Send* Bool sentOK « SendMessage(msgstruct) 
[268] If sentOK goto sentok.wml and then goto messagesjsp 
[269] If ((IsentOK) && (type-wapwap) ) goto sendfailedjsp.jsp 
[270] If ((IsentOK) (type-wapemail) ) goto noemailaddressjsp 
[271] If ((IsentOK) && (type=wapemspm) ) goto notemsablcjsp 

[272] if "Enter Address" goto Enteremaikjsp 

(^Mntererraiijsp 
[273] MsgStruet,emailaddress = emailaddress, 
[274] sentOK — SendMessage(msgstruct) 
[275] if "fix address* goto entermail.wml 

[276] for (1-0 ; I * numofarchives) CreateLinkT©Archive(archivevect [I]) 
[277] if "Archive* pressed, goto archive, jsp (2.10) 

(10) Archivejsp 

[278] Bool GotVeetor- GetArcMessageIDVector(ArchiveID , msglDVect) 
[279] for (1=0 ; I ~ numofmessages) CreateLinkToMsg(msgidvect[I]) 
[280] if "Message" pressed, goto msg,jsp 

[281] if "Delete All* pressed Bool deleteOK « DeleteAIIMessages(msgidvect) 
[282] if (deleteOK) goto MessagesDeleted.jsp, and then goto main.jsp 
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e»— MAS-MPS client 

[283] The MPS client block enables MAS servlets to use MPS transcoding 
services, as well as supplying an API for XML and Base64 functions. Listed 
below are the main functionalities of this block, 

[284] Encoding and Decoding of images from binary to baseS4 aseii, 

[285] Creating XML transcode request 

[286] Posting XML requests to an MPS server rack. 

[287] Receiving and analyzing XML responses. 

[288] Handling MPS errors, 

IV.D. D etails of Media Processor 



[289] The MEDIA Processor provides image processing and transcoding for 
purposes of image enhancement and terminal compatibility, Na\ ve 
transcoding may result in unreadable content on the small screen of a 
mobile terminal. The Media Processor enhances the image to correct such 
faults when the content type is identified, the MPS also supports audio, 
ringtones, animation, video see for example AudioTranscode, 

[290] Communication with the Media processor is implemented using XML 
interface. The Media Processor reports success or failure for an entire 
message as well as for each individual operation ©f the message. The 
media processor supports processing multiple images within a single 
message, 

[291] At least the following media processing functions are available to 

render message images for display on user's device: 
[292] Adaptation functions- media format convert- from (Progressive JPEG, 

Baseline JPEG, JPEG 2000, GIF87, GIF 89A, WBMP, BMP, PNG, EMS, Nokia 

PM) to (Progressive JPEG, Baseline JPEG, JPEG 2000, GIF87, GIF 89A, 

WBMP, BMP, PNG, EMS, Nokia PM) including colour palette adaptation, all 

based on a client submitted device type parameter, 
[293] Image content selections are provided to identify the type of image 

(e,g, - Photograph, Face, Document (e,g. FAX), cartoon, Synthetic (e,g, 

chart), Panoramic (e,g, scenery). 
[294] The MEDIA Processor includes a facility to smart compress images 

(VGA picture with smart JPEG compression takes maximum storage of 

approximately 50k), 
[295] The Media Processor is capable of being shared by multiple clients, 

4t— Enhancement functions 

[296] The media processor provides the following media processing image 
enhancement functions: 

[297] Brighten (dark), Darken (overexposed), Enhance, Colour balance, 
Remove Noise, threshold(local adaptive, standard), adjust levels, sharpen 
(radius, intensity, automatic), de-blur, smooth, histogram equalise, invert, 
flip(mirror), erop(arbitrary or parametric). Remove artefacts, 
resize( nearest neighbour, bi-cubic, bilinear, maximum/minimum 
neighbour, line preserving), salt and pepper removal, local illumination 
correction (arbitrary, emphasise edges), histogram equalisation, 
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histogram manipulation, Brightness, Contrast, Colour modification, Rotate 
(90, 180 or 270 degrees). 

— Auto-Enhancement functions 

[298] The media processor provides the following media processing auto- 
enhancement functions: 

[299] Auto level, auto crop, auto colour balance, auto image content type 
detection, Image classification and Optimisation Processing, Add text as 
graphic, add background, image manipulation (warping), image framing, 
combine images, Add objects (hair, eyeglasses etc,), Include image in 
postcard / template. Camera calibration for common mobile camera 
types, 

9-. — Imaee Stitching 

[300] The media processor provides the following image Stitching; stitch 
36Q~degree panoramic and stitch fax. Full stitching 2 images / arbitrary- 
length series, Image pair matching, Image merging, given shift 
parameters, Image stitch/match given assumptions (e,g, horizontal only), 
stitch (Brightness, Contrast, Colour), 

4-. — Advanced Functions 

[301] The media processor provides the following media processing 

advanced functions: 
[302] Detect face? detect eyes, OCR Recognition, Bar code Recognition, 

picture object recognition, Image recognition (e,g, content type 

recognition to permit optimal transcoding), 

— -Watermarking 

[303] Watermark detect and add functions shall be provided for WBMP and 
JPEG images, A watermark shall support a minimum of 19 decimal digits, 
[304] 

IV.E. I dentifying display characteristics 

[305] The following code segment explains display characteristics 

identification, 
[306] - <target-deviee> 
[307] - <platform» 

[308] <manufacturer>Ericsson</manufacturer> 

[309] <modet>R320«/model> 

[310] <ROM-revision>n/a</ROM-revision> 

[311] <User-Agent>EricssonR320/RlA</User-Agent> 

[312] </platform> 

[313] - <network-connection> 

[314] <nc-type>GSM/CSD</nc-type> 

[315] <nc-speed>9600</nc-speed> 

[316] </network-connection> 

[317] - <target-display> 

[318] <horteontal>88«:/horteontal> 

[319] <hori2ontahscroH>88</hori2ontal-scroH> 

[320] <vertical>52</vertical> 

[321] < vertical-scroll >1 lG«/vertical-scroll > 

[322] <dpi>n/a</dpi> 

[323] <pixel-ratio> l,24</pixel-ratio> 
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[324] - <eolors> 

[325] <type>B/W</type» 

[326] <number>2«/number> 

[327] <bit-arrangement>rtfa</bit~arrangement> 

[328] <palette-LUT»r»/a</palette-LUT> 

[329] <gammi /> 

[330] <brightness j> 

[331] </colors> 

[332] «emstype»none</emstype> 

[333] <pdu>13Q0«/pdu> 

[334] <maxwpdksize /> 

[335] <maxpixels /> 

[336] <Aarget-display> 

[337] «/target-deviee> 

[338] 

IV. F, Additional Processing by Media Processor. 

[339] The Media Processing Server (MPS) is designed to handle all media 
types, including formatted text, images, animations, audio and video, 
with an emphasis on advanced processing algorithms. In a nutshell, some 
of the following funeitonalities are provided; 

[340] 1, Image Transcode - Optimally convert content for a target phone. 
Automatically performs resizing, color palette reduction, compression, 
rotation, watermark detection and more. The transcode operation is 
controlled by a rule based system with configurable parameters for 
bandwidth utilization, format usage, Quality of Service and content 
preferences. Performs different transcoding operations based on 
automatic detection of the content type, 

[341] 2, Audio Transcode - Similar to transcode for audio files. Useful for 
converting audio found on the Internet to MMS phones. Also supports 
conversion of rtngtones between the different formats existing today, 

[342] 3, Video Transcode - similar to image transcode for video files. Also 
supports cross media conversion - video to animation, video to still image, 
video to sound track, 

[343] 4, Image manipulation package: 

[344] 4,1, Rotate - rotates an image by a specified amount with a selection 
of interpolation methods, 

[345] 4,2, Resize - resizes an image with several interpolation methods 
including special modes for phone screen with a small number of 
colors/non-square pixels 

[346] 4,3, Brighten - enhances the image brightness - useful for dark images 
and for adapting to phones with a nonlinear Gamma curve, 

[347] 4,4, Darken - decreases the image brightness - useful for over- 
exposed images and for adapting to phones with a nonlinear Gamma 
curve, 

[348] 4,5, Enhance » combines color and contrast enhancement of an image. 
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[349] 4.6, Color balance - performs color balancing of Images taken by low 

quality cameras or in difficult lighting conditions* 
[350] 4*7* DenoiseSpeckle - noise removal for low-light/noisy camera/data 

trasnsmission errors situations* 
[351] 4*8* Threshold - blnarization of images for B/W screens* 
[352] 4*9* Adjust levels - parametric contrast adaptation/ 
[353] 4*10* Sharpen - fast parametric correction of blurry images* 
[354] 4*11* Deblur - special sharpening for camera images taken in low light 

conditions* 

[355] 4*12* Smooth - smooths a noisy image* 

[356] 4*13* Histogram equalize - automatic contrast range enhancement* 
[357] 4*14* Invert - performs a color/grayscale inversion* Useful for certain 

synthetic images on low contrast phone screens* 
[358] 4*15* Flip - fast mirroring operation* 
[359] 4*16* Crop - cut a part of the image* 

[360] 4*17* ArtifactRemove - JPEG artifact removal* Useful for highly 

compressed JPEG Images (e*g* those transmitted over wireless links)* 
[361] 4*18* DenolseSaP - Salt and Pepper noise removal* 
[362] 4*19* LoclllumCorrect - Correction of lighting non-uniformity. Useful for 

images of printed text* 
[363] 4*20* PremHistEq - advanced histogram equalization for images with 

dynamic range problems* 
[364] 4*21* ColorPaletteAdapt - Reduce the number of colors In an Image 

using a fast algorithm* Useful for image file size reduction/adaptation to 

phone screens with a small number of colors* 
[365] 4*22* FaceDeteet- automatically detects a human face in a frontal 

facial image* Useful for capturing the most important part of an image for 

display on a limited size screen* 
[366] 4*23* EyeDetect - automatically detects the eyes*nose section of a 

frontal facial Image* Useful for capturing the most important part of an 

image for display on a limited size screen (e,g* Nokia Picture Message)* 
[367] 4*24* Add Text - Add formatted text to an image (with font selection)* 
[368] 4*25, Add Object - Add an object (hat, eyglasess etc*) to a photo* 
[369] 4*26* Add Frame - Add a frame (several selections) to a photo, 
[370] 4*27, Add Effect - artistic effects (warp, sphere, twirl etc,), 
[371] 4,28* Embed Watermark - embed a watermark in an 

image/audio/video file, 
[372] 4*29, Detect Watermark - fast detection of an existing watermark in an 

image/audio/video file* 
[373] 4,30* Smart Compress ~ reduce the file size of the image/audio/video 

file to below a specified limit* Useful for reducing network bandwidth and 

for overcoming memory limitation in handsets* 
[374] The MPS supports In a single product the complete range of processing 

requirements for the full spectrum of future MMSC infrastructure users* 
[375] 1* The phone MMS user, composing and sending an MMS from a 

phone* In this scenario the primary need is for fast transcoding and 

automatic content type identification and processing* For example, images 
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taken by a user with a camera-phone need JPEG artifact removal, 
automatic contrast and color enhancement and face/eye detection for 
maximum utilisation of target display screen size. 

[376] 2. The Internet MMS user, composing advanced MMS messages from 
an internet-based interactive Multimedia Album, This user can play around 
with images and audio/video objects, add text/objects/frames to image, 
compose and use existing SMIL templates etc. Relevant functionality in 
the MPS includes support for interactive manipulation of images (adjust 
contrast, add formatted text, add a hat to a person in the image etc,), 
efficient storage of images (smart compress), 

[37?] 3, Advanced MMC scenarios, where a sequence of processing 

operations is performed on an MMS prior to sending - for example, detect 
watermark, block/report to billing system based on watermark info, 
compress audio component to reduce total MMS size while maintaining 
overall quality, convert video sequence to animations etc, 

[378] 4, Content providers - these providers have large amounts of content 
with specific, detailed processing sequences based on their 
preferences/knowledge of the content characteristics. Such providers will 
utilize the more advanced options of functions such as Transcode, 
compress, color palette adaptation, embed watermark etc. 

4T=Transcoding 

[379] The main functionality of Transcode is to convert an image so it will fit 
into a target device while maintaining the best quality possible. In order to 
fit an image to a specific device, the main considerations are; 

[380] 1, Resizing the image until it is small enough (in pixels) to fit the 
device. 

[381] 2. Reducing the image's color/bit depths to the device capabilities, 
[382] 3, Converting the image to the specified format - typically this format 

should be supported by the target device, 
[383] 4. Ensuring that the resulting file size does not exceed the memory 

limitations of the device 
[384] 

[385] The algorithm used by Transcode can be divided into three main 

stages, according to the above criteria. 
[386] 

«ri — Stage I: Resize-Fit 

[387] 

[388] In this stage the image is resized to fit the target device. For better 
quality, other image attributes (like bit depth) are not reduced yet 
(actually they may even be enhanced), Different variant of the resizing 
algorithm are used for different contentType values. Some parameters 
that may influence the result of this stage are; 

[389] Device dimensions, scroll-size, maximal allowed pixels, etc. 

[390] Source and target aspect ratio of pixels 

[391] The choice whether to use just the physical screen or the full scrollable 
screen - this is controlled by a configuration parameter, but overriding it 
in the XML-request is possible (useScroll). 
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[392] The option to rotate the image by 90 degrees in order to get a larger 
view (rotatetobest) - the default value allows rotation for small-screen 
devices only (cell phones vs. PDAs, PCs). This may be changed in the 
configuration or the request itself. 

Stage II: Color Fit 

[393] 

[394] At this stage the imagers bit depth and color space (i.e. color to gray) 
may be reduced in order to best fit a device. (For example, a color image 
with 24 bits of data per pixel may be reduced to a grayscale image with 2 
bits of data per pixel in order to fit a screen that has only 4 gray levels). 

[395] The image is run through a series of specially designed filters that 
maintain maximal image quality while reducing the bit depth, 

[396] Specifying the contentType of die image can also control the behavior 
at this stage. For example, a lineart-type image is treated differently here, 
with filters that are designed to preserve as much detail as possible of 
lines and shapes, as opposed to a face/object image, in which the 
processing involves sharpening of facial features, or "scenery* photo-type 
image, in which the main point is to preserve color and brightness 
accuracy as much as possible. 

e^=-Staee III; Creating the output file 

[39?] When die image has reached its final size and depth, it must be 
converted to die format requested (after making sure it is supported by 
the target device). This stage could have been straightforward, but we 
must also make sure the file is small enough for the device's memory to 
handle. In some cases, after the file is created, it may be necessary to 
repeat the previous stages and create an even smaller image, until the file 
size itself is small enough. 

— Other stages; 

[398] Stage 0 in contentType = "document* consists of local thresholding. 
[399] Reiteration of die process with stricter limitations if the output file size 
is too large, 

2. W atermarking 

[400] Watermarking (WM) consists of embedding hidden information within 
media files/objects, which may be used as part of a digital rights 
management system (DPM) - for billing, copyright, content-blocking etc. 
The information content of the watermark in MPS is defined as a 19-diglt 
numeric string, excluding 0 (i.e. l«=tWM<1019). 

[401] Currently watermarking is supported for the formats jpeg, gif and png 
and is performed by hidden comments - for jpeg and png; these 
comments won*t be visible through typical viewers. But it can also include 
watermarking of B/W at the image level, regardless of the format. The 
typical scenario for watermark usage is through devices that do not 
normally manipulate images, but may send images previously received 
from an MPS system without tracking information. 

«4— -Watermark functions: 

[402] EmbedWatermark - This function is used to embed the watermark 
(numeric string). It can be used only when die specified output format is 
one the supported WM formats. 
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[403] DetectWatermark ^This function detects the MPS watermark 

embedded in an image / media file* It is relevant only for WM~supported 
input formats. Note: The output of this function differs from typical MPS 
output - it is the watermark (or 'watermark not detected' message) and 
not an image, 

[404] RemoveWaterrnark - This function Is used to dear the watermark 
from an image, Note: This operation may happen as a side effect of most 
methods for some formats /implementations, 

[405] It should be noted What currently the watermark functionality is 
designed mostly for demonstration and evaluation purposes. When 
integrated as part of a well-defined DRM system* watermarking 
functionality may include: 

[406] Method X + PreserveWatermark: To maintain the identification of an 
image after transcoding / basic manipulation* an alias of the following 
combination may be used - DetectWatermark ~>wm; method-X; 
EmbedWatermark (wm), 

[407] Method X + ManipulateWatermark: Another possibility is that the 
output-watermark will have a different value than the input-watermark, 
either by applying some mathematical function to it* or by the some DRM 
component that will issue a new value and maintain a log of the 
relationship between these values, 

3^ — Overview of Image Processing Algorithms 

[408] The system introduces a large number of image processing algorithms 

designed for: 
[409] 1, Image adaptation and manipulation 
[410] 2, Illumination correction 
[411] 3, Noise reduction 
[412] 4, Sharpening 

[413] This grouping of methods is far the survey convenience only. The 
methods are simple enough to allow good definition of the parameters 
involved. Each method deals with common problems* relevant to image 
processing implementations, Still, the full collection of these methods does 
not allow dealing with complex problems, which are addressed by 
transcoding, premium and advanced packages. In complex scenarios it 
may be difficult to choose appropriate methods, for correcting the 
problem without introducing undesired side effects, which may degrade 
image quality to an undesired level, 

4*? — Common features 

Color treatment 

[414] In the image manipulations, denoising and sharpening functions the 
colors are treated independently, This means that each method is well 
defined for grayscale images. Thus it is applied with the same parameters 
3 times, once for every RGB channel, There is no essential difference in 
handling the indexed Images vs, the continuous color/grayscale Images, 
Such treatment of color channels is simple and intuitive, It allows better 
understanding and description of the parameters involved. More elaborate 
color space treatment shall be implemented in the context of premium 
package scenarios, 
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[415] In the Illumination correction functions there are usually 2 modes of 
Implementation with separateColors parameter acting as selector. When 
set separateColors-false, the method would handle all channels in a 
combined manner, 

fe^-»Smoothing kernels 

[416] Some algorithms included (noise reduction, sharpening etc.) use linear 
convolution with a pre-defined kernel as the main processing tool. The 
most common convolution is convolution with a simple Gaussian kernel. 
However, using convolution kernels with other shapes might improve the 
performance of the algorithms, 

[417] The algorithms which are not very sensitive to the kernel parameters 
use Gaussian kernel with standard deviation automatically calculated from 
the effective radius, 

[418] The algorithms which are sensitive to the kernel parameters use one of 
the following shapes: 

[419] 1, rect - Rectangular shape is the most common, since it allows very 
fast computation. The problem with this kernel is its emphasis on diagonal 
edges, which are seldom present In the image, 

[420] 2, diamond - This is a rectangular shape rotated by 45 degrees. The 
best feature of this shape is its ability to emphasize horizontal and vertical 
edges of man-made structures and geometrical objects. The other reason 
for emphasis on horizontal lines is the fact that they are hardly Influenced 
by discretization process, 

[421] 3, ellipse - Circular, or more generally elliptic kernel treats in the same 
way structures of every orientation. This is a more general-purpose 
kernel, used with natural images, 

[422] 4, softEHipse - In ellipse the edges are hard-threshold; either 1 or 0, 
In softEllipse the edges can have a value between 0 and 1, This allows a 
better approximation of disc shape. This feature is suited for linear 
operations and may cause artifacts with non-linear filters (median, 
contrast stretch etc) 

[423] 5, gauss ^This stands for Gaussian filter, e,g, *gaus0707Vgaus0505* 
and *gausAuto*, The later two indexes stand for the standard deviation 
value of the Gaussian in each direction. The recommended setting 
*gausAuto* automatically calculates sigma based on the radius of effective 
coverage of the Gaussian, The Gaussian kernel allows graceful 
degradation of the pixel weight far from the center of tine smoothing 
kemel. This feature is ideal for linear convolution, 

[424] 6, kX - E,g, *kl* and *k2*. Reference to bank of pre-defined kernels to 
be used in special scenarios. This is a 'premium* interface and there is no 
meaning to kernelWldth and kernelHeight, 

[425] It is possible to use prolonged kernels to emphasize the horizontal and 
vertical edges. For this purpose separate parameters are defined for 
kernelWidth and kernelHeight rather than radius, 

[426] Fine-tuning kernels for images and algorithms can be a tedious task, 
therefore some basic recommendations are given where possible, 

— Image adaptation and manipulation 
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[427] The elementary image processing operations deal with image size and 
orientation. These functions, namely Rotate, Resize, Rip and Crop, are 
available in every basic image manipulation package. Threshold, 
Compress (with GIF output) and Invert methods are used to adapt the 
image to the display color palette and minimize the use of target device 
memory space. Resize and Threshold provide advanced modes - which 
perform more sophisticated processing, 

[428] The methods in the basic image manipulation package can be 
optimized for speed, and can include platform specific speed-ups in all 
platforms (Intel, Solaris, etc), 

[429] In addition to providing an image manipulation package, these 
functions enable advanced compression and building your "home-made 
transeode*. Typical variations of such experimentations with transcoding 
consist of a sequence such as erop->resize->eompress, or crop-> 
(optional) rotate => resizes quantization/threshold - to adapt large 
images to small displays. Crop allows spending all of the limited screen 
resources on the main objectj resize minimizes the information contained 
in the picture and compression/quantization discards less-important 
information. Rotation is used to use the display dimensions and aspect 
ratio as best as possible. On some media the image has to be inversed 
prior to display (e,g, scanned negatives), 

a^— -Color palette adaptation 

[430] ColorPaletteAdapt fits the image to a limited palette. This is useful 
either when the device or file-format has a limited color capability or when 
file size is an issue. Once the palette is defined, each pixel is assigned a 
value from it; this is done either by assigning each pixel the nearest 
value, or dithering - a method which increases color resolution at expense 
of spatial resolution. As default, dither is used when the specified number 
of output colors is small, but the user may explicitly specify whether 
dithering should be used. Dithering is not recommended when output file 
size is a major issue, but is recommended when the device color capability 
is the issue. Note that ColorPaletteAdapt with paletteName=*B/W* is 
equivalent to threshold with the default parameter, for B/W adaptation 
one may prefer using advanced modes of Threshold, 

b4— Threshold 

[431] Threshold converts a grayscale image into a discrete B/W image. It 
may used as part of other more complex conversion operations (e,g, 
Transcode), and can serve for artistic effects or image combination 
effects. For example: converting a formatted text/textured text image into 
B/W before sending to a B/W screen, reducing the color content of a 
single layer (e,g, object for combining in an image) so it will not add to 
the color palette of an image etc. It allows explicitly controlling the 
threshold level, automatically finding the optimal global threshold, or 
applying a local threshold (mode localV2 is usually inferior to local). 

Compress 

[432] The method compress attempts to reduce file size without changing 
the image size. It may achieve this goal by more efficient coding, reducing 
colors and losing some details. In some case it may follow an operation 
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Intended to reduce the image size. It Is wise to apply compress in 
combination with an efficient image format (he, jpeg / jpeg2000 for 
storage, gif for most devices) 

[433] This function has a different implementation according to the 
requested output type. The problem of file size is important in several 
aspects: Most of the currently available phones / hand-held devices have 
limited capacity, both for single file and in total. Bandwidth in cellular 
networks may also be an issue. In addition, in messaging system storing 
many millions of MMS messages, the typical file size becomes an issue to 
consider too. On toe other hand most images entering the system may 
not be limited to the few-Kbytes range suitable for hand-held devices, 

[434] The palette and processing power limitations of currently available 
hand-held devices makes the compression utility especially relevant for 
image/gif output mime type, 

[435] In this case compress activates adaptive quantization procedure, which 
provides for a clear image with minimally reduced color palette. Detail 
reduction, image resizing and cropping are not supported by the compress 
method and require dedicated requests, 

[436] 

[437] The implementation of toe algorithm is based on adaptive reduction of 
the color palette and smoothing for GIF/PNG images, and on JPEG DCT 
quantization table variation and smoothing for JPEG images. The 
parameters are changed iteratively until the maximum quality setting with 
a file size under toe limit is reached, 

[438] It is important to know that if the target file size is too small, the 
algorithm will return an error message. This reflects the fact that even 
under a color reduction to a bitonal image the file size under the given 
compression method was too large. In this case the image should first be 
resized, then compressed, 

[439] 

[440] The parameter available for this function is maxSize, which is the 
maximal allowed image size in bytes. For a CIF sized image, to be 
displayed on a typical hand-held device, some recommended sizes are: 

[441] 

[442] Image /Device type MaxSize (bytes) 
[443] VGA image - storage Up-to 50000 
[444] Uneart/MMS message 4000 
[445] Uneart/WAP phone 1400-2000 
[446] T68/WAP client 2300 

[447] Natural image/PDA with MMS/email client 8000 
[448] 

4^=Invert 

[449] Some image sources (e.g. scanned negatives) and output devices 
require image inversion. The inversion is performed for each color channel 
separately* so that yellow is transformed to blue, white to black etc, This 
is a simple function so it does not require any parameters. It is most 
useful for e,g. synthetic images displayed on low contrast screens, 

c) R otate 
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[450] This is a standard implementation of image rotation. The parameter 
specifying the amount of rotation counter-clockwise in degrees is 
mandatory. Values out of 0-360 range are corrected by the algorithms* so 
that -90* 270 and 630 degrees rotation have the same effect, 

[451] For angles other than 0,90,180*270 the output pixels of the original 
image are not a rectangular image* this poses a choice of what 
rectangular image should be returned and what should be the values of 
the pixels not present in the input image (in 90*270 the size changes - 
width <~> height - but there is no dilemma defining the output image). 
The parameter mode determines the output size: mode^full returns the 
bounding block image of the rotated input image* mode=crop crops the 
rotated image to the size of the original image (with the same center), 
The portion of the output image* not present in the input is always padded 
with a black background. For rotation by multiples of 90deg both modes 
provide the same output, 

[452] The interpolate parameter allows to choose the interpolation 
technique, interpolate^bilinear is usually a good choice. It is 
computationally efficient and does not introduce large artifacts. Selecting 
interpolate=bieubie provides for a more sharp and accurate image at cost 
of computational efficiency and ringing artifacts, interpolate=nearest 
selection is most efficient computationally and does not alter image 
palette. However it may introduce some aliasing artifacts. This parameter 
is also ignored for multiples of 90deg, 

f) Resize 

[453] This speed-optimized function serves to change the image size. It can 
be used to fit an image into a small phone screen* or to reduce an image 
size prior to compression and storage, For example* an incoming 3 mega- 
pixel image from a high-quality digital camera may be resized to VGA 
(640 by 480) size prior to JPEG compression and storage* in order to 
reduce storage space requirements, 

[454] The mode parameter selects the interpolation algorithm. Beside the 
usual bilinear* bicubic and nearest methods* proprietary methods are 
supported to provide for optimal performance with various image types 
and target devices, 

e*— Flip 

[455] Flip the image horizontally (verticaNfalse) or vertically (vertical=true). 
For an upside-down one should call this method twice* each with a 
different parameter value (or call Rotate 180 degrees), 

faj — Crop 

[456] Crop may be used when the final image size is limited and the more 
significant details are concentrated in a limited region of the image. 
Cropping most of the background allows applying a more moderate resize. 
After the application of crop method a rectangular area 'cut' from the 
original image is returned, Crop*s interface is the following 
[457] top -the upper bound of the image, range; 1-ImageHeight 
[458] bottom -the bottom bound of the image; top-ImageHeight 
[459] left -the left bound of the image; 1-ImageWidth 
[460] right -the right bound of the image; left-ImageHeight 



25 



WO 03/001770 



PCT/IB02/04148 



[461] The coordinates start from the top-left corner of the image with 
coordinate (1,1), rather than (0,0) used in some other commercial 
software. The rectangle specified by the four parameters has to have a 
positive area, so left <= right, top<*=bottom. The edges are included in 
the rectangle. 

— 'Illumination correction and color manipulations 

[462] The illumination correction is one of the more difficult problems in 
image processing. There are many ways to correct for improper 
illumination/ detector problems. Basic solutions work only on a small 
range of imaging situations. The methods given below are just the most 
simple and intuitive tools, while the premium package contains more 
complex and elaborate algorithms to deal with the problem. 

[463] The AutoLevel method with empty radius parameter is perhaps the 
most powerful algorithm available in the basic package. 

a^=Darken 

[464] This method darkens the image, and has two modes: 

[465] If intensity is not specified, the function performs a darkening which 

may be described as the dark half of AutoLevel. 
[466] If intensity is set, the image is darkened by the specified amount 

(Intensity : [range; 0-1 (0- do nothing, 1-maximal)]). When intensity is 1, i 
all mid levels become black and only colors white (or brightest level in 
channel X) remains as it was. Darken with intensity X is equivalent to 
Adjust levels with eontrast-Q, brightness «* -X. 

fe^=— Brighten 

[467] This method brightens toe image, and has two modes; 

[468] If intensity is not specified, the function performs a brightening which 
may be described as the brighthalf of AutoLevel. 

[469] If intensity is set, toe image is brightened by the specified amount 
(intensity ; [range: 0-1 (0- do nothing, 1-maximal)]). Brighten with 
intensity X is equivalent to Adjust levels with contrast=0, brightness = X. 

el AdjustLevels 

[470] This method gives the user full control and responsibility of toe output. 

Both parameters are mandatory. 
[471] brightness ; [range; -1 - +1 (-1 * black, +1 ■ white), recommended 

range: -.3 - +.3 ] 

[472] contrast ; [range; -1 - +1; (~l=monotonie image, +l=pure color 
image ), recommended range: -.3 - +.3] 

[473] For positive contrast values, first the brightness is adjusted and then 
the contrast. For negative values, contrast is adjusted first. The effects of 
both these operations is accumulated, so the maximal effect when the 
contrast is positive is the sum of the contrast value and the absolute value 
of brightness. This sum should not exceed 1, or be too close to it (i.e. 
eontrast=brtghtness-0.6 will result in a white image, just like 
brightness- 1). 

d^-— AutoLevel 

[474] This is the primary function for illumination correction included in the 
basic package. The global AutoLevel (empty radius parameter) maximizes 
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image contrast, discarding the outliers of very high and very low intensity. 

This is especially useful for images taken in the haze, rain etc, 
[475] The local AutoLevel is activated, setting some positive radius 

parameter. The recommended radius values are in range [10-30], For 

small radius values the image appears grainy. Unlike the global AutoLevel, 

the local AutoLevel stretches the contrast w/o outlier detection. This effect 

is achieved if used after DeneiseSaP, 
[476] The separateColors-true setting allows illumination/color correction as 

well as luminance correction, 
[477] 

e^= — CoIorBalance 

[478] This function tries to produce truly colorful images, by increasing the 
contrast in hue domain. Thus a white object on a blue background will 
appear yellow. This method is similar to the correct illuminant effect in 
vision. On most natural images the effect is very small. Setting mode=l is 
mandatory* The level can vary in range [0-1] with default 0,5, Both 
parameters are mandatory, 

WhiteBalance 

[479] This function calculates the mean value of the gray pixels in each 
channel and brings it to 0,5, The pixel is reported as gray if its color in all 
3 channels obeys abs(pixeLyalue- 0.5)«toleranee, The tolerance is a 
user-supplied parameter with default value 0,1S, The color correction is a 
gamma correction, with automatically calculated parameters tor each 
channel. 

e^=ColorVariations 

[480] This is a two-stage function. In the first stage the gamma of each color 
channel (e.g. red, green, blue) is changed according to intensity value 
between -1 (remove color) to +1 (saturate color), In the second stage the 
saturation of the color is changed by changing the Euclidian distance 
between each channel value and gray image value, such that -1 stands 
for grayscale and 1 stands for saturated colors, 

[481] For a simple cmos based camera such as the Communicam it is usually 
recommended to put 

[482] red « ,2 

[483] green = ,2 

[484] blue - -,1 

[485] saturation « ,3 

fc^-=HistEqualize 

[486] This method performs Histogram equalization (no parameters). The 
resulting image has a uniform histogram (as much as possible considering 
the input color distribution). This is a common solution illumination 
correction, but it has side effects, such as eliminating the real color 
distribution of the image (e,g, adaptive thresholding of the result of 
histogram equalization, is likely to have poor results), 

ft=-PremHistEq 

[487] Unlike the HistEqualize, PremHistEq trades off the speed and simplicity 
for the flexibility of operation. It has a large set of parameters and modes 
of operation which have different effects. 
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[488] P-law histogram equalization allows a trade-off between simple 
histogram equalization (pval=0), no effect at all (pval=l), dominant 
modes emphasize (pvals>l) and dominant modes destruction (pval«l). 
The recommended values are 

t489] Pval Scenario 

[490] 0.3 Histogram equalization of moderate intensity 
[491] 0.7 Histogram equalization of small intensity 
[492] 1.5 Posterizatlon 

[493] -0,2 Histogram equalization of very large intensity 

[494] Local histogram equalization allows histogram equalization on blocks of 

limited size. The recommended size of the block is in the range 0,2 - 0,6 

of the image size. 

[495] Number of histogram bins has quantization effect. For pval<0 it is 
recommended to use at least 32-64 bins. The algorithm uses linear 
interpolation between bins. 

44— LocIllumCorrect 

[496] This method performs local illumination correction and has a large 
amount of sub-methods chosen by correctionType. Other methods which 
locally correct tine illumination level are AutoLevel (local) and, for binary 
output, the local mode of Threshold. This procedure is effective for non- 
uniformly lighted handwritten and printed text as preprocessing to 
advanced applications, such as OCR and feature detection, but it may 
sometimes degrade the visual quality of the Image as perceived by 
humans. Some safety mechanisms were introduced to limit the visual 
degradation of the image. One of this mechanism is setting 
separateColors=false to preserve the original hue of the image. 

[497] The LoclllumCorrect can produce unexpected results with images with 
very limited color palette (16 colors and less). 

ia— GloblllumCorrect 

[498] The global/block-wise illumination correction allows automatic 
correction of image curves with the following sub-methods chosen by 
correctionType; 

[499] gamma - gamma correction, so that the mean value of the image is ,5 
(Stay)* 

[500] curve - a variation of gamma correction with highlights and shadows 
subjected to separate gamma values. 

[501] contrast - synonym for AutoLevel. The addition functionality is bloc- 
wise processing. 

[502] histiEq - synonym for PremHistEq with different interface (number of 

bins and power is selected automatically). 
[503] The correction is global, unless blkHeight and blkWidth are both set. 

The recommended block size is 64x64 or 128x128. The blocks overlay, so 

that their borders are virtually invisible for block size larger than 32x32. 

The separateColors parameter allows to select color channel treatment. 

Usually separateColors-true provides for desired color correction. 
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[504] The GloblllumCorrect can produce unexpected results with non- 
photographic images (e,g, lineart) and images with very iimited color 
palette (16 colors and less), 

1^— Enhance 

[505] This function performs a selected combination of methods based on 

enhanceType parameter, 
[506] If the enhanceType is empty the function performs mild color balance 

+ contrast enhancement. It can be used safely on various input images 

and should improve many of them. The effect is similar to ColorBalance 

followed by Autelevel (global), 
[507] If enhaneeType=eommunieam, batch processing is executed using; 
[508] o ArtifactRemove «> Deblur ~> PremHlstEq -> Crop «> 

ColorVariations GloblllumCorreet, This processing is essentially 

acceptable for various JPEG input images, but most suitable for 

communeam output images, 

3-. — Noise reduction 

[509] The two most common types of noise treated by the basic package 
are: 

[510] 1, White Gaussian noise / speckle noise 
[511] 2, Salt and Pepper (S&P) noise 

[512] White Gaussian noise appears as an intrinsic part of the cheap camera 
detectors, especially in low illumination conditions - it is inaccuracy in the 
pixel values - for many pixels. This is the most common type of noise, 
which appears on most of the images, 

[513] The S&P noise, is a small number of pixels having big "errors* in their 
intensity levels. It appears as a result of interlacing/aliasing in the 
detector, faulty detector, sharpening of degraded images, communication 
problems, poor JPEG compression, scanning of analog photos. This type of 
noise is more rare and easier to treat than the Gaussian noise, 

[514] Since the noise appears independently in each color channel, the noise 
reduction procedures are independently applied to the color channels, 

[515] The noise types defined in this procedure do not have clear meaning in 
line-art images, therefore a natural image source is assumed. For indexed 
images a rich color palette is required, 

a}— Smoothing 

[516] The most common and simple way to deal with noisy, over-sharpened 
images and compression artifacts is smoothing. Smoothing is performed 
by a simple procedure of convolution between the original image and a 
Gaussian kernel, 

[517] The output of this method is a smooth image, where the degree of 
smoothness increases with the optional intensity and radius parameters, 

[518] Since there is no common scenario for the use of smoothing method, 
there are no dear recommendations about the smoothing parameters. For 
medium smoothing which does not degrade image significantly we can 
recommend intensity =0*5, radius=3, 

fe^— DenoiseSpeckle 

[519] This is essentially an empiric Wiener filter. It essentially uses algorithm 
developed by Lee (1980) with minor modifications. The main modifications 
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include a bank of smoothing kermis, the overfilter parameter and 
regularteation process. 
[520] This de-noising procedure is essentially a local adaptive smoothing 
procedure, where the overfilter parameter plays the role of smoothing 
intensity, and the smoothing kernel selection plays the role of radius in 
the smoothing method, 

e4=DenoiseSaP 

[521] This is a standard 2-stage procedure of outlier detection followed by 
their replacement with local median. Changing the threshold influences 
the detection rate. Increasing the threshold will allow under-smoothing, 
while decreasing the threshold will allow over-smoothing. It is 
recommended to work with kernels of 9-25 pixels. 

Sharpening 

[522] On most amateur photos the image is not sharp enough. The common 
reasons are; bad focus, motion blur, JPEG compression. To increase the 
sharpness of the image one can use a sharpening procedure, 

a) Sharpen 

[523] This function implements a standard unsharp masking procedure 
(which is equivalent to smoothing with negative intensity) if edges=false. 
In this setting the function increases the noise in the image, which is 
usually not recommended with originally noisy images. Setting 
edges^true will result in sharpening only over the edges, which is a 
preferable mode of operation. In this mode the radius parameter is 
ignored. The level of sharpening can be controlled by the intensity 
parameter, 

a^= ArtifactRemove 

[524] This function deals with heavy JPEG artifacts only. The artifacts are 
listed below: 

[525] 1. Blocking: various sharp borders between 8x8 tiles 

[526] 2, Ringing: high frequency moir* noise in 8x8 tiles 

[527] 3, Color spill: color channels not coinciding with luminance channel 

[528] 4, Blur; high frequency edge blur 

[529] The ArtifactRemove method with deJPGtype-detail settings does not 
degrade the image visual quality, This is a general-purpose tool, which 
can be used for any JPEG input. However, for any specific problem it is 
recommended to use the dedicated method instead, 

[530] 

Deblur 

[531] Most of tine digital photos taken by low-quality cameras appear to be 
out of focus or blurred. Simple sharpening does not solve this problem 
completely, especially in the noisy environment. Some off-the-shelf 
products (e,g, Extensls Intellehance) suggest blind deconvolution with 
adaptively selected kernel. The kernel is selected, so that maximal 
sharpening is achieved with minimal ringing artifact. While this method is 
implemented deblurType-defocus, it is not recommended in most cases. 
In the presence of noise and JPEG artifacts, the proprietary 
deblurType=premiumSharpen method proves to be more consistent and 
does not degrade image visual quality. The main difference between 
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Sharpen and Debtor (premiumSharpen) is the use multiple kernels and 
homogenization as a part of sharpening procedure. For more sharpness 
one ean use deblurType^premiumSharpenMore. 

4&?-Functional description of AudioTranscode 

[532] The support of multiple input and output format* optimized for the 
specific device, is as important in sound processing as It is in image 
processing* The main sources of audio are MP3 and WAV files, available 
on most PCs, disks and internet sites. These formats allow very rich sound 
at the expense of the disk space. The transmission bandwidth and 
memory of the hand-held devices is very limited. The audio capabilities of 
these devices, don't always make use of the richness of the input formats 
(e.g. stereo) More efficient compression techniques (e.g. GSM AMR) have 
been defined in the context of MMS for playing/recording audio on these 
devices. The audio transcoding process performs the following tasks; 

[533] lv Down-samples and compresses rich audio files (MP3,WAV 
GSMAMR ) 

[534] 2, Masks compression artifacts and allows to play compressed files on 

PC (GSMAMR=>WAV, effect = mask) 
[535] 3, Converts GSMAMR files from abundant to minimal compression 

rates and vice-versa (GSMAMR« >GSMAMR) 
[536] 4. Supports Unix audio standards (AU to anything and anything to AU) 
[537] 5, Adds effects to audio files (via the effect string) for audio quality 

improvement without issuing a separate request, 
[538] 6, Automatically selects the conversion method via the mime type, 
[539] 7. Performs ringtone conversion between Nokia Ringtones (part of the 

NMS standard) and iMelodies, TDD polyphonic tones (part of EMS 

standard and extensions to it respectively), 

IV. G. Other specific enhancements 

■h — Using the MMS Box as a WAP Site 

[540] WAP terminals have a built-in WAP browser. It is possible to go 
to a Web site with the terminal, and cali down relevant information. The 
server will process the information called to optimize it for display on the 
terminal, and the processed information will then be transmitted to the 
terminal for display. This information may or may not be processed 
further by the terminal or by the server, according the user's request. 
Information which has been processed (either once or twice) may then be 
stored, in the terminal, or at the server, or at another information storage 
place specified by the user. Transmission to and from the terminal may 
be by wireless or wireline communication, 

— Converting a WBMP Into a Picture Message 

[541] WBMP is the WAP protocol for graphics. Images on the terminal 

may be displayed In PM format, not WBMP. The server may receive a 
WBMP image, convert it into the PM format, and transmit the message for 
display on the terminal. This conversion is new because the protocols 
WBMP and PM are both new, and therefore the conversion has not been 
performed previously. 

[542] Converting a WMBP Image into an EMS Picture Message 
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[543] This innovation Is exactly the same as converting a WBMP into a 
picture message described above, except that the target format for 
display is EMS rather than PM. It will be appreciated that images may be 
converted into any number of picture formats, be it PM* EMS, or a 
different picture format to be devised in the future* such as Smart 
Messaging, The algorithms in the server make this transcoding possible. 
Again, this is new because the protocols, WBMP and EMS, are both new, 
and therefore the conversion has not been performed previously, 

2t— Human Face Recognition and Display 

[544] Refer to Figure 1 attached to and incorporated into this 

applications In the picture, the woman's face may be recognised by 
algorithms defined in prior art. The invention includes innovative 
algorithms resident in the server which allow the server to process the 
relevant part of the picture, in this case the woman's face, for display on 
the terminal. There are three types of algorithms. The first is orientation, 
The face is oriented vertically, which means that the vertical dimension of 
the relevant part of the picture is greater than the horizontal orientation. 
Some terminals have display screens that are wider than they are tall. To 
capture the full image on one screen would reqire a reorientation of the 
woman's face from verticl to horizontal. The server knows the display 
characteristics of the terminal, and will perform this orientation. 

[545] The second kind of algorithm is "resizing*, Terminal displays 
are generally smaller, often much smaller, than the source image. The 
server will know these characteristics, and will accordingly resize the 
picture for display on the terminal, 

[546] The third group of algorithms is those which will reproduce the image 
on the terminal's display, while maintaining the integrity and quality of the 
image as much as possible. The need for these algorithms arises from 
the small display screen, or from the inherently lower resolution of the 
terminal display, or from other reasons. The server will know the 
characteristics of the terminal display, and will apply the correct 
algorithms for maximum effect. Examples of such algorithms include 
enhancement, dithering, and histogram correction. 

[547] The application of any or all of these algorithms to handset displays is innovative. The 

use of prior art face recognition as part of the system and method described herein is also innovative. 
ft^— Face Binarization 

f*Hfta*&es to BMP 

[548] FIG. 1 8 shoes a block diagram explaining the procedure. 

[549] Image is more or less frontal, eyes should be visible, and illumination variations should not be too 
extreme. Constraints are set both by face detection requirements and by binarization requirements. Size 
of face in image should be sufficiently big. 

[550] a. Face detection, eyes detection: 

[551] Preferably, this would be a face detection SDK (e,g, trueFace, Facelt). 

[552] b. Illumination correction (to the extent possible). 

[553] c. Facial features emphasizing: 

[554] Increasing contrast - brightening the skin, darkening features (eyes, lips, eyebrows, etc.) and hair. 

[555] Sharpening. 
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[556] d. Optimal resizing - undo the blur. 
[557] e. Binarization with dithering. 

^=Faoes to Pk&ure message 

[558] Output: The eyes strip. 

[559] Additional components: eye detection. 

[560] This is especially useful when one has to display a part of the human face on screens that dictate a 
wide and short frame size - e.g. many phones have an aspect ratio of 2: 1 or more in the width/height of the 
display. In addition, the PM (picture message) format of Nokia Smart messaging dictates that images are at 
most 72 pixels wide by 28 pixels high. See examples in attached figure 26 showing the extraction of the eye 
region from an image and then converting it to a picture message. 

4?— Combination of Histogram Correction and Dithering 

[561] A histogram in the current context Is the process by which the 

various pixel values In a grey level Image are distributed on a frequency 
chart, from pure white through various shades of grey to pure black, 
Histogram correction is the process by which some of these values, but 
not all, are lightened or darkened, but even those values affected are 
changed to different degrees, Dithering is the translation of grey level 
images to black & white by the correct combination of the black and white 
pixels to simulate grey in the eye of the user. The use of dithering fbr 
small screens, such as those on the terminal displays discussed here, is 
entirely new. Histogram correction, even for small screens, is not new. 
However, the use of histogram correction in the method and system 
described herein is new. Further, there is a technique by which histogram 
correction is applied first, and then dithering, to transcode a very high 
quality image onto a terminal display that protrays only black & white 
images, This technique is entirely new, and is part of this invention, 

^—Combination of Floyd Steinberg Dithering and Random Permutation 

[562] Floyd Steinberg dithering is a well-known dithering algorithm in 

which error diffusion methods are used to create visually appealing 
dithering with relatively few fixed repeating patterns, Random 
permutation is a technique by which a few random black pixels are 
changed to white and a few random white pixels are changed to black, 
Random permutation is used to avoid "periodicity*, which is a situation in 
which there are appear to be very dramatic changes in shading from one 
part of a picture to an adjacent part of the picture, This problem is 
particularly prevalent when large pictures are compressed into a smaller 
area such as a small display terminal. Random permutation softens the 
effect of these changes, Floyd Steinberg dithering is part of prior art, as 
is random permutation. However, the combination of first Floyd Steinberg 
dithering and then random permutation is new, and is part of this 
invention. Further, the addition of this combination followed by 
transcoding to WBMP, EMS, or PM, as the case may be the particular 
target terminal, is entirely new, and is part of this invention, 

& — Correction for Non-Square Pixels in the Target Display 

[563] Not all terminals have square pixel displays; sometimes the 

pixels are rectangular. If that is the case, then the server may need to 
convert say a 100 x 100 square pixel picture into say 80 x 100 rectangular 
pixel display on the target terminal. The server will know the chacteristics 
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of the display terminal, and will perform the required correction; that 
technique is new. Further, the addition of this technique with other 
techniques described here, and the transmission of the processed 
information, is entirely new, and is part of the method and system 
described herein. 

^^Transcoding of Color and Shade in Images 

[564] Images generally in one of three types of color or shading, 
which are are color (green, blue, and red, or other permutations), grey 
level, and black & white. Terminal displays today are generally black & 
white only, although grey level is being introduced, color displays are 
planned for the future. The server will know the characteristics of the 
display terminal, and will be able to transcode source images into a 
display format suitable for that terminal. Note that it is possible to 
transcode down, or to transcode up from white & black to grey level, but 
it is not possible to transcode into color. That is, the various possible 
permutations are color to grey level, color to black & white, grey level to 
black & white, or black & white to grey level. However, grey level or black 
& white to color is not possible. Also, needless to say, it is possible to 
transcode at the same level, such as black & white to black & white, 
although different algorithms will be employed to to maximize integrity 
and quality of the image on the terminal display. 

& — Panorama Imaging on a Terminal 

[565] Refer to Figure 2, attached to and incorporated into this 

application. Panorama imaging requires that multiple pictures of a scene 
be taken, and then various images be "stitched* together in order to 
create one continuous scene. [Note; Panorama imaging, and the various 
algorithms employed to make it possible, is the subject of a separate 
patent application by the current assignee, UCnGo, Inc.] By the nature of 
the limited size of the terminal display, the entire picture cannot be 
displayed on one screen. Scrolling is required. In addition, re-orientation 
of toe image may also be required, as demonstrated in Figure 2. It 
should be noted that long text messages, which may not be displayed on 
one screen, may also be formatted for scrolling, and again toe scrolling 
may be either vertical or horizontal, depending on toe type of terminal 
display and the nature of toe text. 

9^=-Embedding a WAP Link in an SMS Message 

[566] This in itself is not new, since it is part of the prior art. However, 
it is new as part of the method and system described herein, particularly 
where the SMS link message serves as a method to deliver multimedia 
content to the user of a WAP phone. 

4^-OTA Bookmarks and Enrollment 

[567] "OTA" means "Over the Air", and is a short expression for real 

time action, in this case via a radio system. To do personalized 
bookmarks and home pages OTA is part of the prior art. Each user is 
assigned a specific URL ("Uniform Resource Locator"), with a common 
beginning string and some additional user-specific string (e.g., 
http i//wap.ucngo.com/mmsservice/userboxes/userid=45FCgeo, where 
the bold part is user-specific part). In this way, different users receive 
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different bookmarks (URLs), and when a user operates a specific WAP 
browser, the UCnGo server receives an HTTP request with a URL unique to 
the given user, and the server wiii know what information to display based 
on the specific URL. To give each user his or her own URL, is aiso part of 
the prior art, but oniy in a wireline environment. To give a URL to a 
wireless subscriber is new. To combine ULR with OTA for personalized 
bookmarks, personalized home pages, URL links, enrollment in various 
services, and other services, is new, and is part of this invention. This 
combination requires both the application of database technology and the 
algorithms defined in this application, 

44^=Correctine for Inversion of Pixels on Target Display 

[568] In some terminals, the display has been intentionally inverted. 

That is, if a 0 bit is usually white, and a 1 bit usually black, in an inverted 
terminal toe 0 bit appears as black and the 1 bit appears as white. 
Therefore, if ahy MMS message at all is sent, the display in an inverted 
terminal will show the message as a negative of the original. ATimeport 
phone, by Motorola, Inc., is an example of such an inverted display. This 
inversion is not a fundamental problem, as long as the server knows the 
terminal characteristics, and can correct for toe inversion. The server in 
this application does know the terminal characteristics, and will correct for 
the inversion before toe message is transmitted. 

43v=-Transcoding of Text or Numerics into a Picture 

[569] The Invention will transeode text or numbers into a picture, in 
WBMP, PM, or EMS format, as required by the target display. This is new. 

4^=Use of Modules Such as OCR and ICR to Identify and Process Text and Images 

[570] The server uses OCR and ICR (Intelligent Character Recognition) 
to Identify which parts of an image are text. Reference is now made to 
Figure 3, incorporated into this application. The first step in processing an 
image is toe processing of the image into parcels such as text and 
drawing. Different processing techniques are then applied to each type of 
parcel. Rules will be applied, such as "Ignore grey level information* 
because toe image may be in black & white, or "Maintain line solidity". 
Without the parsing and application of techniques, the image will be 
reproduced on toe terminal in a manner similar to what is written as 
"NaT. ve Transcoding" on Figure 3, With the invention, the "Optimal 
Adaptation" level is achieved. This process is part of toe invention. 

44f— Flexibility Resizing 

[571] As a specific case, "flexible resizing" is a technique by which 
different parcels are resized differently; for example, text may be resized 
as little as possible to maintain legibility, whereas an image, such as that 
portrayed in Figure 3, may have a greater degree of reduction, since only 
recognizability, not legibility, is required. Flexible resizing is also part of 
the invention. 

4Sr— Adaptive Classification Engine for Smart Resizing 

[572] A variation of flexible resizing is where the decision of flexible 

resizing is generated not solely by recognition algorithms, but rather by 
recognition algorithms in combination with parsed samples. The first step 
of the procedure is that various sample images are fed into the server's 



35 



WO 03/001770 



PCT/IB02/04148 



database. These images have already been parsed, and individual parcels 
have been identified as requiring different processing algorithms, in 
various orders of operation. The parsing, algorithms, and order of 
operation for the algorithms, have been tested by both theory and trial & 
error, and have found to produce optimal results. When a new image is 
then received by the server, the image can be parsed, the parsed parcels 
will then be compared to the database of parsed parcels, and the 
classification engine will then choose, on tine basis of the samples and the 
target image, which algorithms and which combination of algorithms to 
apply to each parcel. This classification is "adaptive* in that in changes 
either with the addition of samples, and/or with feedback from the results 
of processing on real images. The adaptive classification engine is like a 
learning machine that applies rules and improves its own performance 
over time. The concept of a learning machine by itself is prior art. An 
adaptive classification engine for smart resizing of MMS messages is 
entirely new, and is part of this invention, 

44y-Vectorizing and Processing Charts and Cartoons 

[573] The processing of charts and cartoons is similar to the 
description of Use of Modules Such as OCR and ICR to Identify and 
Process Text and Images (see above). It is portrayed in Figure 4, hereby 
incorporated into the application. That is, the image will be parsed to 
determine the various parcels to be processed. Then rules will be applied. 
For display of a standard size chart on the small display of a terminal, for 
example, superfluous information such as a series of values on the axes 
will be removed. Also, a rule such as "remove horizontals* may be 
applied, since the addition of the horizontal to the small display screen 
will make the graph almost unreadable. For the graph itself, a rule such 
as "maintain line solidity* may be applied. The entire image will then be 
resized to the small display. Vectorizing algorithms are prior art. For 
example, Adobe Streamline Software uses this technique. The technique 
has not been applied to small screens such as the terminals discussed 
herein, and that is new. The combination of that technique with resizing 
and the other operations described herein are part of the method and 
system of this invention, 

[574] 

[575] This is a further explanation of the vectorizing and processing of 
charts; 

rt— Charts to WBMP 

[576] Standard charts - graphs of a continuous one-dimensional function, 
e,g, stock charts and other temporal variables, (A better solution can be 
provided more easily for a known source with known format), 

[577] a. Identification and separation of graphics to content layers; graph, 
grid and boundaries, graph label, range text (numbers on both scales), 
background, b. gnore clutter (background, grid, possibly grid 
boundaries). 
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[578] c. Handle range; Ignore most values; maintain min, max at required 
size. 

[579] If possible; safely resize. Use Oar if needed/possible, 
[580] d. Handle graph label; 
[581] Long text; ignore, or ocr and return as text. 
[582] Short text (e.g. stock symbol/name); preserve label, but move it as 
needed. 

[583] Problem; range, label may not always be property identified/handled 

for unknown format. 
[584] a. Resize graph; 

[585] Determine the maximal allowed size of the graph (after labels 

w waste*); With maximum scroll; with graph fully in screen; with graph and 
label fully in screen. 

[586] Determine which size to use. 

[587] Resize graph maintaining line continuity, and without thickening lines 

without necessity (consider graph as b/w). 
[588] Aspect ratio does not necessarily need to be maintained. 

^Hltoek diagram: 
[589] FIG. 18 shows a block diagram for the procedure 

-Charts to Picture message 

fl>=C©»^&e»jts - changes JOpom wbmp; 
[590] a. Grid boundaries ignored (or just marks); b. Handling range; Ignore 
horizontal range; c. Range and labels; Use Ocr if possible (return; graph 
+ *MSFT, range 80-115*); d. Resizing graph is the same (with different 
parameters). 

4?r-=Coiiversion Algorithms s 
a^=Cartoon 

^=Catt<K)ft to WBMP 

[591] Content for the system is b/w line art. The size and amount of text 
should allow applying moderate resizing and fit, or recognizing through 
OCR. Typical samples are available at *K;\QA\Image_Banks\Comics\sliced 
comlesXb&w*. 

[592] a. Correct conversion of generic content within the one-block 
envelope. 

[593] b. Generic block identification and splitting 
[594] c. Handling text; 
[595] i. Identifying text. 

[596] ii. Identifying whether standard conversion would be ok. 
[597] Hi. More moderate resizing + word reordering 
[598] Problems; 

[599] This may not always fit. Do we split it to multiple images? 
[600] Identifying optimal resizing. 

[601] iv. Specialized resizing - to allow legibility after resizing. 
[602] v. Recognizing with Ocr; 

[603] Evaluate and tune use of both scansoft and ParaScript (cartoon text is 
closer to handprint) 
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[604] Appropriate preprocessing, 
[60S] Problems: 

[606] Cartoon text Is small and not neatly printed - Ocr may fail, 

[607] Handling non-English text, 

[608] vi. Support getting text as input, 

[609] d. Recognizing and cleaning non-relevant text 

[610] FIG, 19 shows a Block diagram without OCR. 
[611] FIC20 shows a Block diagram with OCR; 

<3^= QmmtL\X!> Ptettwe SMS 
[612] AH limitations and stages for wbmp apply, 
[613] Further envelope constraints; 
[614] Text must be Oer-able or given as input, 
[615] Text limited to 60 characters, 
[616] Image part content must fit in 28 pixel (height). 

An Example of a Combination of Techniques: Dithering, and then Adapting 
Photographs (with different image material") to WMBP. EMS, and/or PM. format: 
— "Generic" photo binarization 
WBMP 

[617] Typical scenes to be handled; car(s), profile of person, full body, 

multiple faces/people, skyline, signs, trees, etc, 
[618] Desired output is the silhouette of the objects) in the image, an 

identifiable binarization of the scene, 
[619] FKS,20 shows a Block diagram for this procedure 

[620] a. Emphasizing boundaries, decreasing intra-surface changes 

(illumination, etc.), 
[621] b. Resizing 
[622] c. Identifying silhouettes, 
[623] d. Brightening background 

[624] e. Appropriate dithering within objects - when needed (only big 
surfaces), 

[625] f. Combining the silhouettes with the surfaces, 
[626] A data set for tuning and evaluation should be collected from current 
image banks + web, 

^= Pfeofcss to Ptetur© message; 
[627] Problem; most scenes ean"t be converted to 28 pixels. It is usually 

difficult to identify important parts of the scene 
[628] Additional components; 
[629] "Central object* detection, 
[630] When that is not possible - additional resizing, 

IV.IL VAS extended functionality 

[631] It is expected that a large portion of MMS traffic will be generated by 
VAS application such as; photo albums, game sites, news sites etc. The 
UCnGO MCS provides special functionality for such services; 

[632] 1, Watermark embedding/detection for digital rights management 
(DRM) - in order to track/block/limit the distribution of copyrighted 
content, a mechanism is provided to embed a watermark in any media 
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object. This watermark does not change the Image/sound of the media 
object in a perceptible way. It should be noted that this method does not 
interfere with other DPM mechanisms such as those provided already by 
Nokia (by special tags in WAP pages) and Ericsson (tags in content 
descriptor)* It provides an additional mechanism which is independent of 
external tags. 

[633] 2, Content Based Transcoding - the conversion of visual information 
(images, animations and video) to displays with limited color range, 
memory and resolution is enhanced by knowing the type of the visual 
content. For example, the optimal conversion algorithm of a chart into a 
B/W display is different from the optimal algorithm for a natural 
photograph. See figure 4 for examples, 

[634] 3. Layered graphics input - the vast majority of professional graphic 
and photographic content is prepared using Adobe software tools such as 
PhotoShop, It is stored in a special format (PSD) which maintains the 
different layers of the image. Different layers can include the background, 
text, images added from other sources etc. It is not unusual for a high 
end image for the publishing or web-publishing industries to include over 
50 separate layers. The MCS supports the PSD format as an input format, 
thereby preserving the original quality and enabling the content based 
detection module to handle each layer separately , 

JKJ. M essage interface 

£635] The MPS supports two distinct interfaces to the MMSC/extemal 
applications: 

[636] 1, An XML-RPC/HTTP interface, enabling platform and operating 
system decoupling between the MMSC and the MPS, This interface, 
documented in the attached ICD, enables complete control over the 
manipulation and conversion operations of the media objects and works at 
the media level, 

[637] 2. A 3GPP standard, message-based interface designed in to make the 
integration of the UCnGO MPS as standard as possible for the OEM MMSC 
integrator or the VAS provider. The interface is based on the 
MM7/MM4/MM1 protocols. Using this interface, a complete unchanged 
MMS/Email message as received from the WAP GW/the other operators 
MMSC/ the VAS can be sent as is to the MCS, and the response from the 
MCS can be sent as is to the recipient phone/MMSC/VAS server, 

[638] Specifications: 

[639] The SMTP protocol (default port 25, configurable) or the HTTP POST 
protocol (configurable port) is used to transfer the message for 
processing. The message can be any standard MM1/MM4/MM7 message is 
defined in the 3GPP TS23.140 Release 5,20, document. The target 
device(s) type identification can be performed in the following manners: 

[640] » The message header contains a set of (one or more) *X-MMS- 
UserAgent* or "X-MMS-UAprof* or *X-MMS-Moder descriptors - in this 
case these descriptors are taken as representing the model types for the 
different intended recipients as they appear in the message in the TO; 
data field. 
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[641] The message header contains no extra information about the target 
devices* but an LDAP based user/device database has been configured to 
supply device parameters based on a user's MSISDN or email address. In 
this case the MCS performs an LDAP query for each target recipient 
specified in the ""TO:** field of the message in order to find out the 
recipient terminal type, 

[642] 

[643] The processed response in returned in one of the following manners ; 

[644] The MCS can be configured to send the processed messages to a 
target SMTP server as MM7/MM4 messages. This way the MCS can sit 
between the external VAS/extemal MMSC and the local MMSC with no 
configuration changes. 

[645] The MCS can be configured to send the processed messages via HTTP 
POST to a target server as an MM1 message. This way the MCS can sit 
between the WAP GW and the MMSC MMS proxy. 

[646] The processed response(s) will be sent in MIME multipart format, with 
the presentation layer and media objects converted based on the 
recipient device. For example, for a WAP phone the presentation layer will 
be in the text-wml MIME type, and images will be in GIF/WBMP format. 
The message will be submitted once per each target device, since the 
content for the different target devices is now different, having undergone 
conversion. So for example an incoming MM7 message targeted at four 
recipients will generate four MM? messages with one recipient each, 

W.J. P resentation level conversion 

[647] Effective multimedia presentation requires some information on the 
spatial and temporal relations between the different media objects 
presented. This functionality is performed by the presentation layer - 
HTML in web pages and Email, WML in WAP pages, SMIL in MMS 
messages. Some multimedia formats (e,g, EMS) and phones (e.g. Nokia 
3510, Nokia 7210) do not support an explicit presentation language, but 
rather display the different media objects according to their own pre- 
defined logic, 

[648] This means that in addition to the media objects conversion, the 

presentation level description has to be converted. In more complex cases 
(e,g, when no presentation description language exists) the actual media 
conversion has to be changed to account for the presentation logic. Some 
examples are: 

[649] An image and accompanying text is to be sent to a WAP phone. By 
changing the image size target one can guarantee that the text will be 
able to be viewed on the screen together with the Image without scrolling. 
This requires knowledge of the phone's effective (versus physical) display 
size, and control of the image size in pixels, the WML description (e,g, the 
align«Meft* directive for the text), etc. Thus, the generated WML deck 
should contain the proper parameters, 

[650] An MMS message containing a cartoon with a set of images and 
associated text, alongside with SMIL description is sent to a Nokia 3510 
which does not support SMIL, There is no specific order for the media 
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objects - that is, the last image (the cartoon's "punchline*) can be the 
first image in the MMS message. Hence, if the SMIL description is just 
omitted, the received content will be worthless. The presentation level 
logic in this case has to order the media objects according to their desired 
display order as specified in the SMIL description, so that the Nokia user 
will get the cartoon in the proper sequence, 
[651] Specifications 

[652] The UCnGO MCS provides for the presentation level conversion for the 
SMIL,MIME multipart, HTML,WML, EMS formats (see Rg, 37 for the 
supported conversion matrix). Of these, SMIL and MIME multipart are 
supported as input formats, and all are supported as output formats, 

[653] 

[654] The supported conversion operations include: 

[655] 1, Relative positioning of media objects based on target screen size, 

[656] 2, Formatting of text (Bold, Italic, underlined, size) - in EMS, 

WML,HTML, MIME/HTML, 
[657] 3, Picture and animation smart compress - that is, the images and 

animations are adapted to fit in pixel size and total file size based on tee 

directives of 

[658] 4, Audio smart compress for WAV/AM WAC, truncation for ringtenes, 
[659] 5, "video frame rate reduction, resizing, 
[660] 6, Orientation change for images/video based on screen size, 
[661] 7, Different scaling based on parameters - use viewable area, use 

scrollable area, target application type (WAP/MMS etc), 
[662] Supported SMIL tags include: root-layout, region (and relevant tags), 

dur, 

W.K> 1. H igh Level Code for transcoding to a Picture Message 

[663] %This is an algorithm in pseudocode for converting any input image 

into a B/W image with a maximum size of 72 by 28 pixels - 
[664] %The maximum size for a Nokia Smart Messaging Picture Message 
[665] function I4=newdither<fn,dorot), 

[666] ¥e if dorot is specified tee image is rotated by 90 degrees to make it fit 

tee screen aspect ratio better, 
[667] I*imread(fn);I=double(I)/256; 
[668] if exist(*dorot'),I=imrotate(I,90, , nearest , )jend; 
[669] if 

isgray(I),It=zeros(size(I,l),size(I,2),3);It(:,:,i)«IiIt(:,:,2)^IjIt(:,:,3)=Ij 

I=It;end; %we make sure tee image is 3 channel 
[670] %R<3B image even if it started out as a grayscale image, 
[671] I0=rgb2hsv(I);It=I0; 

[672] for Nl : 3,I0( : , : ,i)«filter2(ones(6,6)/36,I0( :, : ,i),*same*) ; end; 

[673] It(:,:,3)~It(:,:,3)-0,75*IQ(:,:,3);% We perform an unsharp operation 

in the luminance channel of the image 
[674] mx~max(max(It(:,;,3))); 
[675] mn-min(min(It(:,:,3))); 

[676] It(:,:,3)=histeg(It(:,:,3));%We stretch the histogram of the luminance 
channel 
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[677] l=hsv2rgb(lt);% And then convert back to RGB 

[678] l«imresize(I,73/(size(I,2))*[0,8*size(I,l) stee(I,2)]/bicbuic»); 

[679] I=I(:,2:end*:); %The image is resized, and eliminate the edge due to 

edge-scaling effects* 
[680] mx-max(I(;)); 
[681] rnn«min(I(:)); 
[682] I=(I-mn)/(mx-mn)| 
[683] I0=rgb2hsv(l);tt-I0; 

[684] for N 1 : 3,10{ : , : ,i)=filter2(ones(6,6)/36 t I0( : , : ,iy same*) ; end ; 
[685] It(:,:,3)«It(:,:,3)-0,S*I0t:,:,3); 
[686] mx==max(maxtlt(:,: t 3))); 
[687] mn=min(min(It(:,;,3))); 

[688] It(: t ;»3)*histeq(It(: t ; t 3));%(It(: t ; t 3)-wn)/(m^mn)j 

[689] I~hsv2rgb(It);%We repeat the same transformation for the resizee 

image with different unsharp parameters, 
[690] cy*floortsize(I,l)/2); 
[691] ll=I(ey-minC13,cy-l):ey+min(14,cy)*: t :); 

[692] [I2,M]*rgb2Snd(IlA > dither , );% We reduce the image to 4 colors with 
dithering 

[693] I2«(rgb2gray(ind2rgb(I2 t M)));% Convert to grayscale 

[694] I2«(I2-min(I2(0))/(max(I2(;))-min(I2(i)))J 

[695] I3^zeros(size(Il,l)»72 t 3);I3(:,:,l)«I2;I3(:,; t 2)«I2;I3(; t : f 3)=I2; 

[696] I4=*dither<13,[0,25 0,25 0,25; 0,75 0,75 0,75;])>0; %finally we dither 

into 2 colors 
[697] return 
[698] 
[699] 

IK K *2* High Level Code for homogenized sharpening 

[700] 

[701] % smart_sharpen_homo — Sharpening algorithm ) 

[702] % 

[703] % Usage: 

[704] % finalJmage~smart_sharpen_homo(initiaUifnage,param) 
[70S] % 

[706] % Inputs: 

[707] % initiaUmage - RGB 

[708] % param - parameters structure 

[709] % param,Radius«[10,2]; %LpHomo Window 

[710] % param,Power=[2 t 16]; ¥«LpHomo Rower 

[711] % param, Weights* [1,1]; %LpHomo Weight 

[712] % param,thrjgain=20; %- threshold-like gain for sharpening 

[713] % param,thr_power*,3; 

[714] % param,thr_window=5; 

[715] % 

[716] % Outputs: 

[717] % finaUmage - RGB 

[718] % 
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[719] % 

[720] % See Also 
[721] % 

[722] % Written by Goldentayer Lev, March 2001 
[723] 

[724] for eh-l:size(initMJmage,3) 
[725] mp$c=initlaU<Tiage(;,:,ch); 

[726] matVaNparam.Radius(l)jmatrixDims * int32([matVal ,matVal ,mafVal 

t matVal ]); 
[727] 

Jl^doubleCiplmexC'ucngoLPHomogenteattonR^VsingleCmpt^^rnatrixDim 

s,single(param.Power(l)) ))} 
[728] matVal«param,Radius(2);matrixDims * int32([matVal ,matVal ,matVa» 

,matVal ]); 
[729] 

32=double(iplmex(*ucngoLPHomogenteationR > %single(mplc),matri>?DIm 
s,single(param.Power(2)) )); 
[730] aO=param,Weights(l);al«param.Weights(2); 
[731] l=(mpte+a0*31+al*J2)/(a0+al+l); 

[732] I«(Mmin(I(;)))/(wax(I(:))-m^(l(0))i 

[733] K«ones(param.thr_window)/pi)ram.thr_window/param 

[734] thr_mult=(max(min(abs(mpiC' 

[735] finaUmage( i ♦ i »ch)=mpic.*(l~thr_mult)+L^hr_mult; 

[736] end 

[737] 

IV.K3. High level code for image color palette and size adaptation 

[739] 

[740] % This is a pseudocode implementation of a smart compression 

algorithm - namely, 
[741] % an algorithm that gets as its input an image file and outputs a 

version of the image in a specific format (GIF in this example) 
[742] % and with a maximum file size ttiat does not exceed the limitations 

dictated by the end device. 
[743] % This implementation iterates the number of colors in the 

quantization step until the file is small enough. 
[744] function en^smartcompress(filename), 

[745] NumCol=[2 8 10 13 16 20 24 32]; %These are the numbers of colors 
in the colormap - can range from 2-256 for a typical 256 color 

[746] %display. These quantized color number steps are dependent on the 
target device - one would choose diffemet steps for other devices 

[747] % in order to minimize the run time, 

[748] SizeThreshold«2720; % This is the upper threshold in bytes on the 

output file size for a T68 using a WAP browser. 
[749] Impath« , Di\ImageMagick\ImageMagick-win2kV; 
[750] netpbmpath~*D:\netpbm\binY; 
[751] NumQuant=length(NumCol); 
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[752] l=imread(filename);l~double(I)/255; %we read the original file and 

convert it to a floating point image, 
[753] Q=rnin(96/size(l,2) t 100/size(l,l)); 
[754] ll=zeros(size(I)); 

[755] for i«li3 t Il(i*^0=fi^2(ones(3 t 3),l(i»:*0/same , )iendj %We create a 

blurred (rectangular kernel) version of the original image, 
[756] IKdouble(l)-0,2S/9*double(Il));I=l-l,*(I<0); %And then substract it 

from the original to create an unsharp operation, 
[757] It=rgb2ycbcr(l)^t«histeq(It);It=imadjust(It,[0 1],[0,1 0,9]); 

%eonversion to luminanee/color space to stretch the luminance histogram 
[758] lt(:,:;l)=histeq(It(i»; t i));It(it:^)«imadjust(histeq(It(ui^))»tO 

1],[0,2 0,8]); 

[759] It(:/, t 3)«imadJust(histeq(It(:»i,3))*[0 1]*[0,2 0,8]); 

[760] l«(ycber2rgb(lt)); % we convert back to RGB colorspace after 

stretching the luminance channel, 
[761] I=histeq(I);I«I,^0,75; %And then we further manipulate the 

histogram, 

[762] I~imresize(l,Q,'bieubic'); %We resize to the target size in pixels (this 

could also be an iteration based on target file size), 
[763] imwrtte(I/inp,jpg7Qualfty\100); 

[764] dos([Impath 'convert -compress L2W * pwd *\inp,jpg * pwd '\inp,gif ]); 
[765] dos([netpbmpath 'giftopnm inp,gif > inp,ppm*]); %We convert the 

resized image to a ppm file so we can run the ppmquant algorithm 
[766] Rle$ize«zeros(l*NumQuant); 
[767] MaxNum<=2; 
[768] for NliNumQuant, 

[769] dos([netpbmpath 'ppmquant * num2str(NumCoi(i)) * inp,ppm > 
out,ppm*]); 

[770] dos([netpbmpath 'ppmtogif out,ppm > out,gif]); 
[771] D«dir(rout,gif]); 

[772] RleSize(i)-D,bytes; %We measure the file size of the output image, 
[773] if (RleSize(i)<SizeThreshold),MaxNum=NumCol(«);end; % If it is 

valid (less than threshold) we update the maximum 
[774] %color number we could achieve 
[775] end; 

[776] disp(MaxNum); 

[777] dos([netpbmpath 'ppmquant * num2str(MaxNum) * inp,ppm > 
out,ppm']); 

[778] dos([netpbmpath 'ppmtogif outppm > shidl,gif ]); 

[779] dos('ftp -s:transfer.bat'); %we put the file in a directory with pre- 

source information (in this case a WML card deck) so that 
[780] %we can view the image on a target device 
[781] end; 
[782] 
[783] 

H igh Level Code for ml&t n&to$m&m. t& $p&£t*tt C&t&t ttfmtts&S fi^> 

[784] 
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[785] function finalJmage=v332_ditheKinitiaLimage) 
[786] % v332_dither — jphone dithering algorithm (for faees,ears etc) 
[787] % This algorithm adapts to displays with a 3-3-2 color palette (R,G,B)> 
This Is 

[788] % prevalent in Japanese color handsets 
[78§5 % Usage: 

[790] % finaUmage«v332_dither(initiaUrnage) 
[791] % 

[792] % Inputs; 

[793] % initiaLimage - RGB 

[794] % 

[79S] % Outputs; 

[796] % finaUmage - 332 image 

[797] % 

[798] % 

[799] % See Also 
[800] % 

[801] % Written by Goldentayer Lev, March 2001 
[802] 

[803] for ch=l:3 
[804] 

[80S] initialjmage^gray « init$aUmage(:,:,ch) J 
[806] 

[807] %mpic=im2double(imresi2e(initiaUmage_gray,[92 98]/bicubSc*))* 
[808] %mpic=im2double(imresize(inttSaUrnagejgray,[74 98]/bicubie))j 
[809] resteejratio=96/stee(initiaUffiage,2); 

mpic«im2double(imresi2e(initiaUmage = .gray,resi2e_ra«o^bicubic , ))j 

[810] 

[811] mpie~unpair(mpic,[l 1 1 1]); 
[812] mpic=(mpic+histeg(mpic))/2j 
[813] 

[814] matVaN2;matrh?Dims = int32([matVal ,matVal ,matVal ,matVal ]); 
[815] 

J4«double(iplmexCucngoLPHomogeni2ationFP' t single(mpic),matrixDim 
s,single(16) )); 
[816] 

[817] a«.3;mpic«(mpic+a*M)/(l+a); 

[818] rgbjmage(:,;,ch)-mpic; 

[819] end 

[820] 

[821] 

[822] jj-Q; 
[823] for 11*1:8 
[824] fori2=l;8 
[825] for i3«l;4 
[826] Jj«jj+1; 

[827] YMAP(jj,l)=01-l)*(l/8)+(l/16); 
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[828] YMAP(jj,2)Kt2-l)*(l/8)+Cl/16); 

[829] YMAP(jl*3)^(t3>l)*((l-l/8)/3)+(l/16); 

[830] end 

[831] end 

[832] end 

[833] 

[834] 

IV.K.5 High Level Code for JPEG artifacts removal 

[835] 

[836] function finaUmage=jpeg J«ter(in!tlaUmage) 

[83?] % jpegjRfcer — Jpeg artifacts filter 

[838] % 

[839] % Usage; 

[840] % flnaUwage=jpegJilter(lnitialJmage) 

[841] % 

[842] % Inputs; 

[843] % InltiaUmage - RGB 

[844] % 

[845] % Outputs; 

[846] % finaUmage - RGB 

[847] % 

[848] % 

[849] % 

[850] % See Also 
[851] % 

[852] % Written by Goldentayer Lev, March 2001 
[853] 

[8S4] xpic=im2double(inttiaUmage) ; 

[855] for eh=l;stee(*pfc,3) 

[856] xfHt(;,;,dh)-fitter2(ones(2)/4^picC; t ;*dh))i 

[857] end 

[858] fbreh«l;stee(>qpte,3) 
[859] 30=xmt(;*:*eh); 

[860] matVaN6;matrtxDims ■ lnt32([matVa« .rnatVal t matVal ,matVal ]); 
[861] 

Jl=double(iplrnex('ucngoLPHornogenteationFP\single(xfitt( ; , ; ,eh)) t matrtxD 
ims,slngle(8) )); 

[862] matVai-2;rnatrtxDiirns = lnt32([matVal »matVal ,matVal ,matVa* ])j 
[863] 

J2«double(iplmex( , ucngoLPHomogentea«onFP%singIe(xfilt( ; » ; ,eh)),matr 
fxD!ms,single{16) )); 
[864] a0=2;al=»5;a2***l; 

[865] xenh(;,;,eh)^(aO*JO+al*31+a2*32)/(aO+al^a2)j 
[866] end 

[867] for ch=l;stee(xpic,3) 

[868] finaUmage(; t ;»ch)=wiener2(xenh(; t ;,ch) t [3 3]); 
[869] end 
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870] 

871] finalJmage~(max(min(finaLJmage,l),0) *(xp!c-K2S)+xpie *(1- 

xp$c))/l,2S; 
872] 

IKK. 6. High Level Code for image content detection 

874] 

875] function eontent=sheH(varargin) 
f876] if{nargin==0) 
[877] figure 

[878] set(gefVName7lmage Content Detection Shell'); 

[879] set(gcf, , MenuBar% , none , )j 

[880] 

HM_utiNufmenu(gcf/Label% , Execute%*CaSlback%*content„detect„v4(* ,, exe€ 

ute")"); 
[881] else 

[882] [in_Jle, itupath] = uigetfileC* *'); 
[883] pauset\2); 
[884] try 

[885] set(gcf/^interywatefV); 

[886] contentJ^pe=content_detect(in.,fite y injpath); 

[887] set(gef/r^ntery arrow')* 

[888] %if<0) 

[889] catch 

[890] beep; 

[891] dtsp(lasterr); 

[892] % dtep(lnvalid Ale name*); 

[893] setCgcf/Pointer^-arTOw"); 

[894] end 

[895] end 

[896] return 

[897] 

[898] 

[899] function contentjtype«content_detect(lnjRle, tnjpath) 

[900] image = name=[in jpath,SnjRle ]; 

[901] inJmg=contenturead(image_name); 

[902] coLsat=eentent_eolor„$attfnJmg); 

[903] backgroundjtfect=eontentJ^ckgroundtinJmg); 

[904] dlther^measure^content^dithertinjmg); 

[90S] unique_colors=content_colors(lnJmg) ; 

[906] 

[907] if(unique_colors«=2) 

[908] grad^vect^content^grad^statCfilterSConesCS^syss^rtJmgC : f i ,1))) ; 

[910] Jftgrad_vect(2)>gradLvect(l)*l) 
[911] res^-Photographical Image;"; 
[912] else 

[913] res^Synthetie Image; *; 
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[914] itXgrad„veet(l)>grad„vect(3)*l) 
[915] 

[916] lftdither_rneasure>,01) 

[917] res = [res, Texture? ']; 

[918] end 

[919] end 

[920] ifteoLsat>0,04) 

[921] res=[res, 'Saturated Colors; ']; 

[922] end 

[923] end 

[924] 

[92S] 

[926] if(unique„colors<-2) 
[927] res** [res, "Binary; 

[928] e«seiftmean2(max(in_img,3)*mSn(in_img,3))==0) 
[929] iftunlque_colors«:«256) 

[930] res=[res t num2str(unique.„eolors), * Colors; ']; 
[931] else 

[932] res~[res, 'Rich Color; ']; 
[933] end 
[934] else 

[935] res«[res, "Grayscale; ']; 

[936] end 

[937] 

[938] lfthaekground_veet(l)>0,,25) 

[939] res«[res, 'Background Color Detected; ']; 

[940] end 

[941] 

[942] 

[943] dlsp(res); 

[944] 

[945] 

[946] %grad_vect 

[947] stat«['Edge; ',nurn2str{grad„veet(l)y \newline V* 

[948] "Curved; ',num2str(grad_veet(2)) t ' \newline V*> 

[949] 'Smooth; " t num2str(gradLvect(3))/ \newllne V« 

[950] Texture; %num2str(dlther_measure)^ \newline V» 

[951] 'Unique Colors; \num2str(unique_eolors)*' \newline V» 

[952] 'Color Saturation; \num2str(eoLsaty \newline V« 

[953] 'Background Area; %num2str(backgrouncLvect(l)) t ' \newline V* 

[954] 'Background Color; \num2str(background_vect(2;end))]; 

[955] 

[956] 

[957] %figure(l), 
[958] %clf; 
[959] subplbt(l,2,l); 
[960] lmsh©w(injmg); 
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[961] «tle([injRte]); 

[962] subplot(l,2,2); 

[963] setCgcfVUnitsVnorrnalized') 

[964] imshow(ones(128,256)*0.9); 

[965] tRte<r\ie,res]); 

[966] hh=text(2Q,70,stat); 

[967] set(hh,TontStee\8); 

[968] 

[969] %disp( H kaki') 

[970] contentjtype«resj 

[971] return 

[972] 

[973] 

[974] function injmg^contentjread (image^name) 

[975] if(isemptytimagejname)), 

[976] imagejrtame-*dolyparton*jpg*; 

[977] end 

[978] 

[979] aa*tmflnfo(lmwuname); 

[980] if(strcmp(aa,Co^orType/trueco^or , )) 

[981] in Jmg=im2double(lmread(image jr»ame)) ; 

[982] else 

[983] [A*Map]=imread(image„name); 
[984] inJmg=irn2doubte(ind2rgb(A,Map)); 
[985] clear Aj clear Map? 
[986] end 

[987] di$p([*tmage; %irnage_narne])j 

[988] return 

[989] 

[990] 

[991] function coLsat-contont,„color_SQt{Hi„lmg) 
[992] dvjmg~std(in_img4*3)j 
[993] ssi~prod( stee(injmg)); 

[994] %disp([ * color saturation; \num2stKsum(dvJmg(;))/ssi)]); 
[995] coLsak=sum(dv Jmg( ; ))/ssi j 
[996] return 
[997] 

[998] function background _yect«content_background(injmg) 

[999] nchannels«stee(injrng t 3); 

[1000] img_area=stee(inJmga)*stee(irU*ng,2) ; 

[1001] [A,M]«rgb2ind{inJmg,256); 

[1002] [aa + ind]=max(imhist(A t M)); 

[1003] backgrouncLyect(l)«aa/prod(stee(A)); 

[1004] backgroundLvect(2; l+nchannels)~M(ind)j 

[1005] dlsp([ * Background size; \aa/prod(stee(A))]); 

[1006] disp([ * Background color: \num2str(M(ind))]); 

[1007] return 
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[1008] 

[1009] 

[1010] function dither„measure=content„dither(U-iJmg) 

[1011] nchannels=sfee(injmg,3); 

[1012] for(ch ~l:nchannels) 

[1013] tmpjmg=ln,jrng(:,;,eh); 

[1014] dither_score(ch)=mean2(Qbs(tmpJmg- 

medfnt2(trnp_Jmg,[3,3]))^ 

[1015] end 

[1016] dither_measure=mean(dithet_score); 

[1017] dlsp([*Dither score\ num2strtdfther =s score)]); 

[1018] return 
[1019] 

[1020] function unIque_coiorsscontent = colors(inJmg) 

[1021] if(sum(injmg(0-arusmg(i)>0))) 

[1022] tA,M]= cmunique(iruimg); 

[1023] unique_colors~length(M); 

[1024] else 

[102S] ifCsum((inJmg(:)'truimg(l))^2)) 

[1026] unique_colors-2; 

[1027] else 

[1028] unique_eolors«l; 

[1029] end 

[1030] end 

[1031] return 

[1032] 

[1033] 

[1034] function grad_vect"Content_grad_jstat(lruimg) 

[1035] nchannels«stee(injmg,3); 

[1036] 

[1037] 

[1038] nhood«ones(S,5); 

[1039] per_up«fioortsum(nhood(i))**75); 

[1040] per_dn«ce!l(sum(nhood( x))*,2$); 

[1041] lf(per_up<=per_dn) 

[1042] per_up=surn(nhhod(i))i 

[1043] per_dn«l; 

[1044] end 

[1045] 

[1046] thejmg^injmg; 
[1047] 

[1048] for(ch«l;nchannels)» 

[1049] grad( ; , ; t ch)=(ordfilt2(the Jmg( : » ; »ch),per„up t nhood)> 

ordfilt2(theJrng( : , ; ,ch),per_dn,nhood)); 

[1050] end 
[1051] 

[1052] thr_edge=,085; 
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[1053] thrjr»oise-*02; 

[1054] edgej9rad-(grad> thr^edge); 

[1055] shape_grad=((grad<= thr„edge)&(grad>=* thr_noise)); 

[1056] noise_g rad = ( (g rad < thr^noise) &(g rad > ■ thr„noise)); 
[1057] 

[1058] for(eh~l:nehanneis), 

[1059] mask_grad(;**,chMshape^rad(*,.,eh) & 

(filter2(nhood»edge_grad(:,i*ch))=-0))j 

[1060] end 
[1061] 

[1062] ssi=prod( steetinjmg)); 

[1063] 

[1064] 

[1065] grad_veet(l)~sum{edge_grad( :))/sst; 

[1066] g rad_veet(2) =su m(shape_g rad( : ) )/s$i j 

[1067] gradjtfect(3)=sum(noise_jgrad(*))/ssi; 

[1068] gradjtfeetW^sumCmask^gradO^/ssi; 

[1069] return 

IV.L* — Caching and Smart Forwarding 

[1070] Certain distribution circumstances in MMS lead to situation 
where the same content is sent to a large number of devices with similar 
or identical characteristics* Examples are; 

[1071] A VAS (e*g* a stock quote provider) sends an update to 

thousands of subscribers at the same time - in this situation hundreds of 
them will have identical phones and therefore media conversion should 
not be repeated for each one* 

[1072] Superdistribution - a user gets a funny message and forwards it 
to his buddies, who then send it onward to their friends etc. Since some of 
the recipients may use the same device* there is no need to keep 
converting the message again and again* Furthermore, the degradations 
caused by continuous conversion should and can be avoided* See for 
example figure 32^ the 2nd recipient has a color enabled display, and 
hence should get a color image/video* This can only happen if smart 
forwarding is implemented* 

[1073] In short, caching means that when a new 

transcoding/conversion request arrives, the MPS looks in the cache to see 
if an identical/practically identical request for transcoding of the same 
media object into an identical/practically identical device has been 
submitted in the past and if the result of this operation is still in the cache* 
If so* the URI of the object in the cache is returned as the transcoded 
result and the actual transcoding operation is avoided* 

[1074] 

[1075] Smart forwarding is similar to regular caching* except that the 
MPS retrieves the cached transcoded result based on the original media 
object* That is, if a content request for object "B* (originally transcoded 
from object "A*) to device "V is requested, the MPS retrieves the cached 
result of "A* transcoded into *T\ not of *B* transcoded into T. 
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[1076] Other modifications and variations to the invention will bo 
apparent to those skilled in the art from the foregoing disclosure and 
teachings. Thus, while only certain embodiments of die invention have 
been specifically described herein, it will be apparent that numerous 
modifications may be made thereto without departing from the spirit and 
scope of the invention. 
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WHAT3tSO.AIMEP.SS 

1. An MMS communication system for displaying images on a display 
terminal of a mobile or portable communication device, the system 
comprising: 

an input adapted to receive pre-souree information; 

a transmitter adapted to transmit the pre-source information; 

a server adapted to receive the transmitted pre-souree information 

and further adapted to convert the pre-source information to source 

information suitable for display on the display terminal; and 

a source transmitter adapted to transmit the source information to the 

display terminals 

2. The system of claim 1, wherein the server further comprises a 
display characteristics identifier adapted to identify display 
characteristics of the display terminal, 

3. The system of claim I, wherein the server further comprises a pre- 
source transcoder adapted to transcode pre-source information to 
source information. 

4. The system of claim 1, wherein the served comprises at least one 
source transcoder adapted to transcode a first source information in a 
first format to a second source information in a second format. 

5. The system of claim 4, wherein the second format corresponds to a 
display format for the display terminal, 

6. A method for enabling communication of MMS information suitable 
for display on a mobile or portable communication terminal, 
comprising: 

inputting p re-source information; 

transcoding p re-source information into source information; 
transcoding source information another source format suitable for 
display on the target display terminal; 

transmitting the transcoded source information to the target display 

displaying the transmitted and transcoded source information on the 
display space of a target display terminal in a format most suitable for 
that particular display terminal. 
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Figure XX: Watermarking algorithm for B/W images (picture messages, WBMP) 

Multilayer Input 
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Figure XX: Layer reduction and its effect on the message appearance 
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Figure XX: content based conversion for cartoons (B&W Line-; 
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Figure: cases where caching and smart-forwarding are required. 
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Specifications 

Implementation: The caching and smart forwarding system is based on a web-caching 
mechanism - converted and original media objects are stored in a specially configured 
set of web-caches, and are retrieved by URL The basic operation scenario is depicted 
in the figures below. 

MMSC 

CLIENT I 1 




Figure: Sequence of operations for a cached transaction 

Algorithmic Cache Sequence 



• For incoming object A (after SMF) + filtered Target Parameters 
calculate MD5 "C" plus target MIME type "D". 

• Perform HTTP HEAD request to check if in cache, request filename is 
"CV'D". 

• If not in cache, perform transcode, perform HTTP PUT for transcode 
result in cache. 

• Return URI. 

Figure: sequence of operations for caching 
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hy 3f 



Algorithmic 



SMF Sequence 



• For incoming object A calculate MD5, extract internal hash "C" MIME 
type "D" from SQL database (indexed query). 

• If new object, put in database as original content, perform HTTP PUT 
for original object in cache. 

• If existing object, perform HTTP HEAD request to check if in cache. 
Object filename is "C'V'D". Update last requested counter. 

• If in cache, return URI. 

• If not in cache, put the new input object A as file with the original 
content hash value "C" as the file name. 

Figure: sequence of operations for smart-forwarding 



♦ Any SQL database can be used, e.g. MySQL. 

• Single indexed table (index on key value)/ double indexed table (last 
requested update). 

♦ Periodic cleanup required of least accessed objects. 

* Table can reside in RAM - e.g. for 10 Million cached messages, less 
than 4Gb of RAM required. 

♦ Can be implemented as an indexed linked list with no database if 
database performance issues arise. 

• Database scaling is simple based on hash value partitioning (e.g. 
server number 1 handles hash values less than TBD etc.). 

Figure: Hash database connecting transcoded objects and original objects 



Hash Database 
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Figure XX: MM1 message interface to the UCnGO MCS 
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Figure XX: MM7 message interface to the UCnGO MCS 
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S: <waitfor connection on TCP port 25> 

C: <open connection to server> 

S: 220 mtnsc.ucngomcs.net Simple Mail Transfer Service Ready 

C: EHLOcnn.com 

S: 250 mmse.ucngomcs.net 

S: 250 SIZE 150000 

C: MAIL FROM: <suUUtaP SIZE=50000 

S: 250 OK 

C: RCPT TO: < +3508401234567(gmmsc.operator.net > 

S: 250 OK 

C: DATA 

S: 554 Start mail input; end with <CRxLF>.< CR><LF> 

C: Date: 01 November 2001 00:00 EST 

C: from: . com 

C: to: +350340 1 2M567@inrtisc. operator.net 

C : subj e ct: N e ws c ast with text an d animat e d gj£ 

C: Content-type: multipart/mixe d 

C: X-fete-Message-Class: personal 

C: X-M^ r Expiry: 05 "November 2002 10:00 EST 

C: X-MSKrdelivery-time: 05 November 2002 09:00 EST 

C: X-J4^-Priority: Normal 

C: ^_=_PartJ - .9840522.997734797Sfi3 

C: Content-Type: text/plain; ^MfeMM 

C: Unisys launches MMS video services in Portugal ... 

C: =JartJL9840522.997734797363 

C: Content-Type: image/gj£}3^^ 

C: Content-Disposition: attachment; SlHJSBl^^ 

C: =_Part_0_9840522.997734797363 

C: Content-Type: image/GIF 

C: Content-Transfer-Enco ding: b asefrt 

C: j I C Y M T Ay M Q CNklkagD E ONyfcM T cuMj A 5 LjkS L1RZUEU9SVB 2N AC P gJ E 3 Lj 

C: bmcgdGHLICJQcmlvcml0eSIgZmLLbGQuIEL0IGLzIHBldCBhcyAiTG93Ii4NC 

C: g== 

C: =_Part_0_9840522.9977347973d3 

C: <CRxLF>.< CRxLF> 

S: 250 OK 

C: QUIT 

S: 221 mmscucngomcs.net Service closing transmission channel 
Figure: Sample MMS submission for transcoding from a VAS through MM7 
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